• Senior AI - HPC

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... years of experience designing and operating large scale compute infrastructure + Experience with AI / HPC advanced job schedulers, such as Slurm, K8s, RTDA or LSF… more
    NVIDIA (04/02/25)
    - Related Jobs
  • Senior AI - HPC Storage…

    NVIDIA (Santa Clara, CA)
    …solutions on any of the leading Cloud environment [AWS, Azure or GCP] + Experience with AI / HPC cluster job schedulers such as SLURM, LSF + In depth ... InfiniBand with IBOIP and RDMA + Background with Software Defined Networking and AI / HPC cluster networking + Familiarity with deep learning frameworks like… more
    NVIDIA (02/05/25)
    - Related Jobs
  • Senior Observability Architect, AI

    NVIDIA (Santa Clara, CA)
    …, HW, and SW engineering and research teams to define a vision and roadmap for AI / HPC cluster observability. + Architect and lead teams to develop, test, and ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....vision and roadmap for distributed observability systems for large-scale AI and HPC clusters and workloads and… more
    NVIDIA (02/13/25)
    - Related Jobs
  • Senior Site Reliability Engineer,…

    NVIDIA (Santa Clara, CA)
    …a variety of HPC or EDA workloads. + Solid understanding of cluster configuration managements tools such as Ansible. + Proficiency in Perl for maintaining legacy ... NVIDIA is the leader in AI , machine learning and datacenter acceleration. NVIDIA is...and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to… more
    NVIDIA (04/04/25)
    - Related Jobs
  • Senior Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    …Make the choice, join our diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation of ... You will also be maintaining and building deep learning AI - HPC GPU clusters at scale and supporting...cluster . + Deep understanding of GPU computing and AI infrastructure. + Passion for solving complex technical challenges… more
    NVIDIA (03/26/25)
    - Related Jobs
  • Senior Software Engineer, AI

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI . We are currently ... Senior Software Engineer to lead the development of AI software resiliency for the most powerful AI...GPUs. Your expertise will be crucial in driving down cluster downtime towards zero, ensuring that our AI more
    NVIDIA (03/19/25)
    - Related Jobs
  • Senior Research Engineer, Foundation Model…

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the ... works on multimodal foundation models, large-scale robot learning, embodied AI , and physics simulation. Our past projects include Eureka… more
    NVIDIA (03/08/25)
    - Related Jobs
  • Senior Technical Program Manager - GPU…

    NVIDIA (Santa Clara, CA)
    …development and large scale distributed computing + Experience managing large scale HPC and/or AI Infrastructure deployments that stretch across hardware and ... Hardware Infrastructure is seeking a Senior Technical Program Manager to lead the strategy...infrastructure we build and operate enables NVIDIAs most advanced AI and hardware researchers and engineers to create the… more
    NVIDIA (03/27/25)
    - Related Jobs
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    …telemetries, scale out cluster , test plan development, track record in developing AI tools and NLP, DevOps, CI/CD experience to join our platform SWQA team. What ... We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional...OEM business. NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains… more
    NVIDIA (04/16/25)
    - Related Jobs