• Research Scientist, AI & Systems

    Meta (Menlo Park, CA)
    …on existing accelerator systems and guiding the future of models and AI HW at Meta. This drives improved performance , new model architectures and ... the following areas: Accelerators/GPU architectures, High Performance Computing ( HPC ), Machine Learning Compilers, Training/Inference ML Systems , Model… more
    Meta (12/20/25)
    - Related Jobs
  • Senior AI Performance and Efficiency…

    NVIDIA (Santa Clara, CA)
    …Understanding of fast, distributed storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning frameworks like PyTorch and ... We are seeking a Senior AI /ML Performance and Efficiency Engineer, GPU...to end + Debugging and optimization experience with NSight Systems and NSight Compute + Experience with debugging large-scale… more
    NVIDIA (11/04/25)
    - Related Jobs
  • Sr. System Development Engineer, High-…

    Amazon (Cupertino, CA)
    …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...the limits of performance , efficiency, and scalability in the cloud, this is… more
    Amazon (10/25/25)
    - Related Jobs
  • Sr Hardware Development Engineer, High…

    Amazon (Cupertino, CA)
    …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...the limits of performance , efficiency, and scalability in the cloud, this is… more
    Amazon (11/05/25)
    - Related Jobs
  • Principal Software Engineer, Networking…

    Oracle (Sacramento, CA)
    …what's possible. Responsibilities + Lead architecture, system design, and implementation for high- performance RDMA solutions across OCI's AI / HPC platforms, ... If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this... performance tuning at scale. + Familiarity with AI / HPC stacks and workloads: NCCL/RCCL/MPI, Slurm or… more
    Oracle (12/20/25)
    - Related Jobs
  • Systems Development Eng (AWS Generative…

    Amazon (Cupertino, CA)
    …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
    Amazon (12/10/25)
    - Related Jobs
  • Architect, AI Compute, OCI, NA

    Oracle (Sacramento, CA)
    …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
    Oracle (01/01/26)
    - Related Jobs
  • Senior Software Engineer, AI Resiliency

    NVIDIA (Santa Clara, CA)
    …Production Deployments: Assist in debugging and performance tuning large-scale AI workloads in cloud and HPC environments, ensuring seamless operation ... AI clusters, HPC environments, or cloud-based AI workloads . + Strong systems programming skills and experience with low-level performance tuning.… more
    NVIDIA (10/15/25)
    - Related Jobs
  • Senior Math Libraries Engineer - Sparsity…

    NVIDIA (Santa Clara, CA)
    …out from the crowd: + Strong understanding of sparse computations, in particular sparsity in AI and HPC + Good understanding of LLMs, Deep Learning methods and ... to simplify and accelerate computing for unstructured sparsity in DL and HPC . Around the world, leading commercial and academic organizations are revolutionizing … more
    NVIDIA (11/18/25)
    - Related Jobs
  • Principal Software Engineer - AI Infra…

    Oracle (Santa Clara, CA)
    …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
    Oracle (11/25/25)
    - Related Jobs