• Senior Systems Engineer - High

    NVIDIA (Santa Clara, CA)
    …Learning Frameworks Infrastructure team as a Senior Systems Engineer focusing on High - Performance AI & Networking Applications, committed to ground-breaking AI & ... exposure to AI/ HPC workflows employing MPI and NCCL. + Familiarity with High -Speed Networking pertaining to HPC including InfiniBand, RDMA, RoCE, and Amazon… more
    NVIDIA (11/11/25)
    - Related Jobs
  • Intern: Hybrid Cloud and Quantum Research…

    IBM (San Jose, CA)
    …areas in the context of hybrid cloud, AI systems, networking, security, high -speed networked-storage, accelerators, and HPC principles. The selected candidate ... with executing HPC workloads * Familiarity with HPC system performance evaluation. At IBM, we...HPC : experience running HPC workloads on HPC systems * Quantum Computing : experience running… more
    IBM (11/21/25)
    - Related Jobs
  • Sr. ML Kernel Performance Engineer, AWS…

    Amazon (Cupertino, CA)
    …offers a unique opportunity to work at the intersection of machine learning, high - performance computing , and distributed architectures, where you'll help ... work on cutting-edge products at the intersection of machine-learning, high - performance computing , and distributed architectures....NVIDIA PTX and/or AMD GPU ISA - Experience developing high performance libraries for HPC more
    Amazon (11/14/25)
    - Related Jobs
  • Intern 2026: AI Systems Research Scientist

    IBM (San Jose, CA)
    …areas in the context of hybrid cloud, AI systems, networking, security, high -speed networked-storage, accelerators, and HPC principles. The selected candidate ... you'll join a team who invent what's next in computing , always choosing the big, urgent and mind-bending work...design * Experience with GPU Systems * Familiarity with HPC system performance evaluation. * Familiarity with… more
    IBM (11/22/25)
    - Related Jobs
  • Sr. System Development Engineer, High

    Amazon (Cupertino, CA)
    …and operate next-generation infrastructure that powers breakthrough innovation in AI/ML and HPC workloads. If you're passionate about pushing the limits of ... performance , efficiency, and scalability in the cloud, this is...through server conception, design, test, launch, and operations. Driving high quality and reliability into future/new designs for AWS… more
    Amazon (10/25/25)
    - Related Jobs
  • Summer Intern - Computational Sciences Center…

    Genentech (South San Francisco, CA)
    …We're seeking a PhD/Master's student with expertise and passion for performance -aware scientific computing , particularly in machine learning systems. In ... performance -aware algorithms that scale on multi-node clusters. + Develop and optimize high - performance GPU kernels, and make trade offs to maximize hardware… more
    Genentech (12/19/25)
    - Related Jobs
  • Senior Math Libraries Engineer - Sparsity in AI

    NVIDIA (Santa Clara, CA)
    …to the design and development of libraries and tools to simplify and accelerate computing for unstructured sparsity in DL and HPC . Around the world, leading ... engineering simulations, using data centers powered by GPUs and high - performance linear algebra libraries. Applications of these...and develop a C++-based system to simplify and accelerate computing for unstructured sparsity in DL and HPC more
    NVIDIA (11/18/25)
    - Related Jobs
  • Software Engineer, SystemML - Scaling…

    Meta (Menlo Park, CA)
    …learning domains: Distributed ML Training, GPU architecture, ML systems, AI infrastructure, high performance computing , performance optimizations, or ... large-scale GPU training and inference fleet through an observable, reliable and high - performance distributed AI/GPU communication stack. Currently, one of the… more
    Meta (12/20/25)
    - Related Jobs
  • Senior System Software Engineer, NCCL - Partner…

    NVIDIA (Santa Clara, CA)
    NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, ... supporting HPC or AI + Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control… more
    NVIDIA (10/06/25)
    - Related Jobs
  • Software Engineering Manager - GPU Communications…

    NVIDIA (Santa Clara, CA)
    …or equivalent experience. + Prior systems software or communication runtime or high performance networking software development experience with a successful ... libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . DL and HPC applications have a...CUDA, MPI, OpenMP, OpenACC, pthreads. + Background with RDMA, high - performance networking technologies (InfiniBand, RoCE, Ethernet, EFA),… more
    NVIDIA (11/20/25)
    - Related Jobs