- NVIDIA (Santa Clara, CA)
- …make a lasting impact on the world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters ... + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + Develop… more
- NVIDIA (Santa Clara, CA)
- …+ Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + Develop ... and operating large scale compute infrastructure. + Experience with AI/ HPC job schedulers and orchestrators, such as Slurm, K8s...such as Slurm, K8s or LSF. Applied experience with AI/ HPC workflows that use MPI and NCCL. + Proficient… more
- NVIDIA (Santa Clara, CA)
- …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + Develop… more
- NVIDIA (Westford, MA)
- …that will be integrated with diverse quantum computing platforms. The lead HPC Engineer will offer technical mentorship, system administration, optimizing ... As the HPC Operations Engineer for the new...As the HPC Operations Engineer for the new Accelerated Quantum Center (https://www.nvidia.com/en-us/solutions/quantum-computing/accelerated-quantum-center/)… more
- Texas A&M University System (College Station, TX)
- Job Title Senior HPC Engineer Agency Texas A&M University Department Technology Services - IT Enterprise Operations Proposed Minimum Salary Commensurate Job ... members' faculty and staff providing cutting-edge research and super computing needs. As a Senior High Performance Computing Engineer ( HPC ), you will provide… more
- University of Pennsylvania (Philadelphia, PA)
- …programs and resources, and much more. Posted Job Title HPC Systems Engineer Job Profile Title Systems Administrator Senior Job Description Summary The Penn ... and motivated High Performance Computing ( HPC ) Systems Engineer to join the team. PARCC's main cluster...systems team. Job Description Job Responsibilities + Collaborate with senior staff to design, plan, test, and implement advanced… more
- NVIDIA (Santa Clara, CA)
- …and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly ... will be harnessing multiple data streams, ranging from GPU hardware diagnostics to cluster and network telemetry. + Work on software that manages NVLINK topography… more
- NVIDIA (Santa Clara, CA)
- Join the NVIDIA Deep Learning Frameworks Infrastructure team as a Senior Systems Engineer focusing on High-Performance AI & Networking Applications, committed to ... for internal teams and external partners on standard methodologies in HPC networking deployments. + Share insights on improving networking strategies for… more
- NVIDIA (Santa Clara, CA)
- …artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... software + Experience with RDMA (InfiniBand or RoCE) fabrics + Background with HPC cluster management tools such as Slurm, PBS, LSF, etc. + Passionate and… more
- NVIDIA (MA)
- …with an interest in advancing artificial intelligence (AI) and high-performance computing ( HPC ) in academic and research environments? We are looking for a Solutions ... background in building and deploying research computing clusters, deploying AI and HPC workloads, and optimizing system performance at scale. What you'll be doing:… more