• Senior Software Engineer - Parallel…

    NVIDIA (Santa Clara, CA)
    …, an advanced compiler that sits at the intersection of compiler technology and high - performance computing . You'll work closely with the PyTorch Core team ... with multi-threading, OpenMP, CUDA, MPI, NCCL, NVSHMEM, or other parallel computing technologies. + Shown experience with low-level performance optimization… more
    NVIDIA (09/05/25)
    - Related Jobs
  • Solutions Architect, AI Hyperscalers

    NVIDIA (Santa Clara, CA)
    …vector databases, and distributed training or inference workloads. + Experience or background in HPC ( High Performance Computing ) environments for AI or ... NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by… more
    NVIDIA (10/09/25)
    - Related Jobs
  • Principal Cloud Software Engineer - Storage

    Microsoft Corporation (Aliso Viejo, CA)
    …**Principal** **Software Engineer** **for** **SSD solutions(Solid-State Drive)** in the AI and HPC ( High - Performance Computing ) fleet with a passion ... efficiencyforstorageoperationsintheproduction fleet + Collaborate with suppliers to design reliable, high performance and quality storage devices + Analyze… more
    Microsoft Corporation (10/27/25)
    - Related Jobs
  • Senior Systems Engineer - High

    NVIDIA (Santa Clara, CA)
    …Learning Frameworks Infrastructure team as a Senior Systems Engineer focusing on High - Performance AI & Networking Applications, committed to ground-breaking AI & ... exposure to AI/ HPC workflows employing MPI and NCCL. + Familiarity with High -Speed Networking pertaining to HPC including InfiniBand, RDMA, RoCE, and Amazon… more
    NVIDIA (11/11/25)
    - Related Jobs
  • Intern: Hybrid Cloud and Quantum Research…

    IBM (San Jose, CA)
    …areas in the context of hybrid cloud, AI systems, networking, security, high -speed networked-storage, accelerators, and HPC principles. The selected candidate ... with executing HPC workloads * Familiarity with HPC system performance evaluation. At IBM, we...HPC : experience running HPC workloads on HPC systems * Quantum Computing : experience running… more
    IBM (10/19/25)
    - Related Jobs
  • Senior Software Manager - Data Center…

    NVIDIA (Santa Clara, CA)
    …NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High - Performance Computing and Visualization. The GPU, our invention, ... has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique...Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're searching for a highly motivated,… more
    NVIDIA (10/21/25)
    - Related Jobs
  • Sr. ML Kernel Performance Engineer, AWS…

    Amazon (Cupertino, CA)
    …offers a unique opportunity to work at the intersection of machine learning, high - performance computing , and distributed architectures, where you'll help ... work on cutting-edge products at the intersection of machine-learning, high - performance computing , and distributed architectures....NVIDIA PTX and/or AMD GPU ISA - Experience developing high performance libraries for HPC more
    Amazon (11/14/25)
    - Related Jobs
  • Senior Math Libraries Engineer - Sparsity in AI

    NVIDIA (Santa Clara, CA)
    …to the design and development of libraries and tools to simplify and accelerate computing for unstructured sparsity in DL and HPC . Around the world, leading ... engineering simulations, using data centers powered by GPUs and high - performance linear algebra libraries. Applications of these...and develop a C++-based system to simplify and accelerate computing for unstructured sparsity in DL and HPC more
    NVIDIA (08/19/25)
    - Related Jobs
  • Sr. System Development Engineer, High

    Amazon (Cupertino, CA)
    …and operate next-generation infrastructure that powers breakthrough innovation in AI/ML and HPC workloads. If you're passionate about pushing the limits of ... performance , efficiency, and scalability in the cloud, this is...through server conception, design, test, launch, and operations. Driving high quality and reliability into future/new designs for AWS… more
    Amazon (10/25/25)
    - Related Jobs
  • Software Engineer, SystemML - Scaling…

    Meta (Menlo Park, CA)
    …learning domains: Distributed ML Training, GPU architecture, ML systems, AI infrastructure, high performance computing , performance optimizations, or ... large-scale GPU training and inference fleet through an observable, reliable and high - performance distributed AI/GPU communication stack. Currently, one of the… more
    Meta (11/05/25)
    - Related Jobs