• Senior AI - HPC

    NVIDIA (Santa Clara, CA)
    …of experience crafting and operating large scale compute infrastructure. + Experience with AI / HPC job schedulers and orchestrators, such as Slurm, K8s or LSF. ... Applied experience with AI / HPC workflows that use MPI and NCCL. + Proficient in using Linux including Centos/RHEL and/or Ubuntu Linux distributions. A solid… more
    NVIDIA (10/30/25)
    - Related Jobs
  • Senior AI and ML HPC

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... years of experience designing and operating large scale compute infrastructure + Experience with AI / HPC advanced job schedulers, such as Slurm, K8s, PBS, RTDA or… more
    NVIDIA (10/19/25)
    - Related Jobs
  • Senior HPC Cluster Engineer…

    NVIDIA (Santa Clara, CA)
    …a lasting impact on the world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA ... experience crafting and operating large scale compute infrastructure, including cluster configuration managements tools such as BCM or Ansible....tools such as BCM or Ansible. + Experience with AI / HPC job schedulers and orchestrators, such as… more
    NVIDIA (12/10/25)
    - Related Jobs
  • Senior Solutions Architect, Cluster

    NVIDIA (Santa Clara, CA)
    …and reference material Ways to stand out from the crowd: + Experience leading large-scale AI Factory or HPC cluster bring-ups or builds + Hands-on experience ... world's most groundbreaking and innovative accelerated computing platforms for AI and HPC . Because of our work,...world's fastest supercomputers. We are seeing a highly motivated Senior Solutions Architect to join the Cluster more
    NVIDIA (12/04/25)
    - Related Jobs
  • Senior HPC Systems Engineer

    Massachusetts Institute of Technology (Cambridge, MA)
    Senior HPC Systems Engineer + Job Number: 25342 + Functional Area: Information Technology + Department: MA Green High Performance Computing Ctr + School Area: MA ... Email a Friend Save Save Apply Now Posting Description SENIOR HPC SYSTEMS ENGINEER, The Massachusetts Green...role will be responsible for deploying, maintaining, and optimizing HPC clusters, storage systems, and networking for AI more
    Massachusetts Institute of Technology (12/04/25)
    - Related Jobs
  • Senior GPU and HPC Infrastructure…

    NVIDIA (Santa Clara, CA)
    NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly… more
    NVIDIA (10/09/25)
    - Related Jobs
  • Senior Systems Engineer - High-Performance…

    NVIDIA (Santa Clara, CA)
    Join the NVIDIA Deep Learning Frameworks Infrastructure team as a Senior Systems Engineer focusing on High-Performance AI & Networking Applications, committed to ... equivalent experience. + 8+ years of proven experience in AI / HPC Infrastructure. + Familiarity with AI...NCCL, NIXL, NVSHMEM, UCX. + Experience developing or maintaining cluster management and monitoring tools Ex: ansible for infrastructure… more
    NVIDIA (11/11/25)
    - Related Jobs
  • Senior Software Engineer, AI

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI . We are currently ... Senior Software Engineer to lead the development of AI software resiliency for the most powerful AI...GPUs. Your expertise will be crucial in driving down cluster downtime towards zero, ensuring that our AI more
    NVIDIA (10/15/25)
    - Related Jobs
  • Senior Solutions Architect, NVIDIA Cloud…

    NVIDIA (Santa Clara, CA)
    …with NVIDIA hardware (such as GPUs, ETH/IB networking components, storage, etc.) within extensive AI and HPC cluster settings. + Practical knowledge of ... expertise in data center design, development and execution for AI and HPC . + Efficient time management...AI benchmarking, and more. + Practical involvement in cluster administration and coordination (SLURM, K8s, etc.). We have… more
    NVIDIA (12/02/25)
    - Related Jobs
  • Senior Solutions Architect, NPN

    NVIDIA (Durham, NC)
    …a hardworking Solution Architect with experience in designing, building, and maintaining large scale HPC and AI hybrid computing solutions to join our team at ... (or equivalent experience). + Established track record working with AI and HPC clusters, both on-premises and...based. + 4 plus years of proven experience with cluster management and related tools, including Docker Containers, Slurm,… more
    NVIDIA (10/19/25)
    - Related Jobs