• Senior ML Platform Engineer , AI…

    NVIDIA (Santa Clara, CA)
    …doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC ...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (08/21/25)
    - Related Jobs
  • Site Reliability Engineer

    LTD Global (Berkeley, CA)
    …computing ( HPC ) and data analysis for the organization. Our center provides essential HPC and data systems to more than 10,000 researchers working in areas ... Position overview: We are seeking a Site Reliability Engineer to join our Operations Group. This role...part of a 24/7 operations team that ensures our systems are accessible, reliable, secure, and available to the… more
    LTD Global (09/23/25)
    - Related Jobs
  • IT Storage Engineer

    SpaceX (Hawthorne, CA)
    …low-latency storage fabrics using RoCEv2 and/or InfiniBand for compute-intensive environments. + Engineer and support parallel file systems (eg, VAST, Lustre, ... IT Storage Engineer Hawthorne, CA Apply SpaceX was founded under...expertise in both enterprise storage platforms and high-performance computing ( HPC ) storage environments. This role is ideal for someone… more
    SpaceX (09/14/25)
    - Related Jobs
  • Senior Storage Performance Engineer

    NVIDIA (Santa Clara, CA)
    …need to see: + 12+ years of experience in performance engineering, benchmarking, or HPC /AI systems . + Deep expertise in AI/ML and deep learning frameworks ... in search of a highly skilled Senior Storage Performance Engineer to join our ambitious team in Santa Clara,...we continue to push the boundaries of AI and HPC technologies. You will have the chance to create,… more
    NVIDIA (09/25/25)
    - Related Jobs
  • Senior High Performance Computing Engineer

    SLAC National Accelerator Laboratory (Menlo Park, CA)
    Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services Division of the Technology and… more
    SLAC National Accelerator Laboratory (07/26/25)
    - Related Jobs
  • Senior GPU Supercomputer Scheduler Engineer

    NVIDIA (Santa Clara, CA)
    …to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads ... be doing: + Design and develop enhancements to the HPC batch scheduler(s). + Work extensively with HPC...and tools + Accomplished in computer architecture and operating systems + Experience analyzing and tuning performance for a… more
    NVIDIA (08/20/25)
    - Related Jobs
  • Senior Site Reliability Engineer - Storage

    NVIDIA (Santa Clara, CA)
    …wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... implementing, and optimizing on-prem High-Performance Computing ( HPC ) storage solutions while harnessing the power of cloud computing. You will be responsible for… more
    NVIDIA (08/21/25)
    - Related Jobs
  • Senior System Software Engineer , NCCL…

    NVIDIA (Santa Clara, CA)
    …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
    NVIDIA (07/07/25)
    - Related Jobs
  • Lead IT Engineer for High Performance…

    Honeywell (San Jose, CA)
    As a **Lead IT Engineer for High Performance Computing ( HPC )** here at Honeywell, you will be at the forefront of our technology initiatives, driving the design ... in optimizing our computing resources and ensuring that our systems operate at peak efficiency. Honeywell's HPC ...our systems operate at peak efficiency. Honeywell's HPC infrastructure spans hundreds of computers in different regions… more
    Honeywell (09/24/25)
    - Related Jobs
  • Modeling & Simulation Systems

    Northrop Grumman (Los Angeles, CA)
    …ability to get cleared to SAP access level. **Basic Qualifications for a Principal Systems Engineer , Modeling and Simulation Systems /Software - (Level 03):** ... specific to modeling and simulation. **Basic Qualifications for an Engineer , Modeling and Simulation Systems /Software - (Level...running Monte Carlo simulations + Experience working with an HPC system + Experience with hardware in the loop… more
    Northrop Grumman (08/22/25)
    - Related Jobs