• AI Factory Digital Twin Engineer

    NVIDIA (Santa Clara, CA)
    …in AI/ HPC data center cooling, including immersion and two-phase systems . + Experience building predictive digital twin frameworks combining physical modeling ... NVIDIA's AI Factories are built to accelerate AI and HPC workloads. At their core the Digital Twin (physics-based...tokens per watt across GPUs, cooling, power, and control systems . We are seeking a Senior AI Factory Digital… more
    NVIDIA (10/22/25)
    - Related Jobs
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …The data center platforms like GB200 NVL72 by NVIDIA are redefining AI, HPC , and cloud computing. To accommodate leading workloads globally, our diagnostic ... systems need to evolve across diverse hardware technologies. We're...We're in search of a visionary technical leader to engineer and propel innovation in diagnostics for NVIDIA's partner… more
    NVIDIA (09/10/25)
    - Related Jobs
  • Senior Math Libraries Engineer , CPU…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an expert software engineer to help us deliver CUDA-X libraries across the NVIDIA CPU and GPU ecosystem. For over a decade, NVIDIA's ... accelerated computing platform has revolutionized HPC and AI with applications ranging from COVID-19 research...domain expert by continuously surveying current trends in software systems . What we need to see: + PhD or… more
    NVIDIA (09/26/25)
    - Related Jobs
  • Senior Software Engineer , MathDx…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an expert software engineer to help us expand our catalog of Device eXtension (Dx) APIs for our math libraries. For over a decade, NVIDIA's ... accelerated computing platform has revolutionized HPC and AI with applications ranging from COVID-19 research...domain expert by continuously surveying current trends in software systems . What we need to see: + PhD or… more
    NVIDIA (09/20/25)
    - Related Jobs
  • Senior Software Validation Engineer

    NVIDIA (Santa Clara, CA)
    …Docker containers & Jenkins pipelines + Certifications in storage (eg, SNIA) or HPC systems or Storage Performance experience with mdtest or FIO tool. ... be. We are looking for a Senior Software Validation Engineer to lead software validation activities in the Datacenter...streamlining our testing processes. + Validation of distributed Storage systems (eg, Lustre) on AI/ HPC Datacenter scale… more
    NVIDIA (10/15/25)
    - Related Jobs
  • Software Engineer , SystemML - Scaling…

    Meta (Menlo Park, CA)
    …Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on ... of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - Scaling / Performance Responsibilities: 1. Enabling reliable… more
    Meta (11/05/25)
    - Related Jobs
  • Base Command Manager Engineer - Nvis NPI

    NVIDIA (Santa Clara, CA)
    …10+ years of experience in at least two of the following: HPC /large-scale cluster administration, Linux systems engineering, infrastructure automation (eg, ... the world. We are seeking a dedicated Base Command Manager (BCM) Engineer to support product deployments/escalations and collaborate with Engineering and our Field… more
    NVIDIA (08/24/25)
    - Related Jobs
  • Principal Firmware Engineer - Server…

    NVIDIA (Santa Clara, CA)
    NVIDIA data center systems , such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring ... Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technical...the system software level. Including firmware, kernel drivers, operating systems , and user mode drivers. You will work with… more
    NVIDIA (11/12/25)
    - Related Jobs
  • Sr. ML Kernel Performance Engineer , AWS…

    Amazon (Cupertino, CA)
    …base. Working at the intersection of software, hardware, and machine learning systems , you'll bring expertise in low-level optimization, system architecture, and ML ... (design patterns, reliability and scaling) of new and existing systems experience - 5+ years of full software development...experience - Expertise in accelerator architectures for ML or HPC such as GPUs, CPUs, FPGAs, or custom architectures… more
    Amazon (11/14/25)
    - Related Jobs
  • Senior Software Engineer , Networking…

    NVIDIA (Santa Clara, CA)
    …develops the next generation simulation framework that spans across multiple Networking Operating Systems related to HPC , Ethernet AI, and more. We expect you ... NVIDIA is looking for a highly motivated, creative, and passionate Software Engineer to design and develop a simulation software to integrate with many networking… more
    NVIDIA (10/27/25)
    - Related Jobs