• AI and ML HPC Cluster

    NVIDIA (Santa Clara, CA)
    …that power some of the world's most advanced computing workloads. NVIDIA is looking for an AI / ML HPC Cluster Engineer to join our MARS team. You ... including performance analysis and optimizations + Analyze and optimize cluster efficiency, job fragmentation, and GPU waste to meet...ahead of emerging technologies and effective approaches in the HPC and AI / ML infrastructure fields.… more
    NVIDIA (01/03/26)
    - Related Jobs
  • Senior AI and ML HPC

    NVIDIA (Santa Clara, CA)
    …for continual learning and staying ahead of emerging technologies and effective approaches in the HPC and AI / ML infrastructure fields. Ways to stand out from ... join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...including developing scalable automation solutions + Build and maintain AI and ML heterogeneous clusters on-premises and… more
    NVIDIA (10/19/25)
    - Related Jobs
  • PCIe QA Engineer

    Broadcom (San Jose, CA)
    …with L2/L3 protocols especially RoCE( RDMA over Converged Ethernet ) protocol & use cases in AI / ML , HPC cluster is a plus + Having Knowledge of deep ... PCI-E-based designs, and hands-on experience in Python programming. Good understanding of AI / ML clusters, Deep learning models, and GPU Micro benchmarks is a… more
    Broadcom (11/06/25)
    - Related Jobs
  • Senior AI - HPC Cluster

    NVIDIA (Santa Clara, CA)
    …for continual learning and staying ahead of new technologies and effective approaches in the HPC and AI / ML infrastructure fields. Ways to stand out from the ... experience crafting and operating large scale compute infrastructure. + Experience with AI / HPC job schedulers and orchestrators, such as Slurm, K8s or LSF.… more
    NVIDIA (10/30/25)
    - Related Jobs
  • Technical Program Manager, AI Network Infra

    Meta (Menlo Park, CA)
    …stack, Network Hardware (NICs, Optics & Switches) 20. Experience Developing & Delivering AI Cluster Solutions for training & inference use cases **Preferred ... AI product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible… more
    Meta (12/20/25)
    - Related Jobs
  • Product Manager, AI Platform Kernels…

    NVIDIA (Santa Clara, CA)
    NVIDIA's AI Software Platforms team seeks a technical product manager to accelerate next-generation inference deployments through innovative libraries, communication ... on the NVIDIA Platform, and push the boundaries of what is possible with their AI deployments! For Inference, we are the champions inside NVIDIA for AI more
    NVIDIA (12/10/25)
    - Related Jobs
  • Solutions Architect - NVIDIA Cloud Partners

    NVIDIA (Santa Clara, CA)
    …with NVIDIA hardware (such as GPUs, ETH/IB networking components, storage, etc.) within extensive AI and HPC cluster settings. + Practical knowledge of ... bridge the gap between design and deployment of large-scale AI and HPC GPU infrastructure. Do you...to be part of the team that brings GenAI, AI , ML , etc. hardware and software technologies… more
    NVIDIA (12/16/25)
    - Related Jobs
  • Senior Network Development Engineer

    Oracle (Sacramento, CA)
    …force, driving the development and design of state-of-the-art RDMA clusters tailored specifically for AI , ML , HPC workloads. We strive to be the go-to ... Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute...leveraging our deep understanding of the unique demands of AI / ML and HPC applications. By… more
    Oracle (12/13/25)
    - Related Jobs
  • Network Development Engineer

    Oracle (Sacramento, CA)
    …force, driving the development and design of state-of-the-art RDMA clusters tailored specifically for AI , ML , HPC workloads. We strive to be the go-to ... Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute...leveraging our deep understanding of the unique demands of AI / ML and HPC applications. By… more
    Oracle (11/25/25)
    - Related Jobs
  • Senior Software Engineer - Storage

    NVIDIA (Santa Clara, CA)
    …and tools that enable researchers and engineers to develop the next generation of AI / ML systems. By joining us, you'll help design solutions that power some ... of GPUs and petabytes of storage in multi-region clusters. + Collaborate with AI / ML research teams to understand their requirements and translate them into… more
    NVIDIA (12/02/25)
    - Related Jobs