• AI and ML HPC Cluster…

    NVIDIA (Santa Clara, CA)
    …that power some of the world's most advanced computing workloads. NVIDIA is looking for an AI /ML HPC Cluster Engineer to join our MARS team. You will provide ... be doing: + Support day-to-day operations of production on-premises and multi-cloud AI / HPC clusters, ensuring system health, user satisfaction, and efficient… more
    NVIDIA (01/10/26)
    - Related Jobs
  • Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big picture of how ... fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU… more
    NVIDIA (01/13/26)
    - Related Jobs
  • Senior GPU and HPC Infrastructure…

    NVIDIA (Santa Clara, CA)
    NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly… more
    NVIDIA (01/08/26)
    - Related Jobs
  • Staff Quality and Reliability

    Google (Sunnyvale, CA)
    …architecture and its integration within AI /ML-driven systems. As a Quality and Reliability Engineer for Google Cloud, you will lead the development of ... Staff Quality and Reliability Engineer , Google Cloud _corporate_fare_ Google...Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability more
    Google (12/30/25)
    - Related Jobs
  • Principal Software Engineer , Networking…

    Oracle (Sacramento, CA)
    AI Infrastructure Innovation team is pioneering the creation of next-generation AI / HPC networking for GPU superclusters at massive scale. Our mission is ... system design, and implementation for high-performance RDMA solutions across OCI's AI / HPC platforms, including frontend and backend fabrics. + Innovate… more
    Oracle (12/20/25)
    - Related Jobs
  • AI /ML Infrastructure Engineer

    Oracle (Sacramento, CA)
    …solutions across Oracle's enterprise customers. We are seeking a highly skilled ** AI /ML Infrastructure Engineer ** to design, build, and support the systems, ... troubleshooting, and best practices. + Stay current with emerging trends in AI infrastructure, agent frameworks, HPC systems, and cloud-native technologies;… more
    Oracle (01/13/26)
    - Related Jobs
  • Principal Network Engineer - DC…

    NVIDIA (Santa Clara, CA)
    …a passionate engineer who will solve networking problems for scalable AI clusters. This is a hands-on network engineering position focused on the architecture, ... and deployment of global-scale DCs inter-connects and fabric for HPC , AI , and GPU computing clusters. +...reliability . + Partner with system, OS, GPU, and HPC teams to deliver scalable, highly available networks for… more
    NVIDIA (01/10/26)
    - Related Jobs
  • Senior Principal Software Development…

    Oracle (Sacramento, CA)
    …Forward Deployed Engineer (FDE) team is hiring a Senior Principal Software Development Engineer - AI Data Platform to help global customers unlock the full ... to streamline the adoption of Oracle AI Data Platform and Gen AI services. + Optimize performance, scalability, and reliability of distributed data/ AI more
    Oracle (01/11/26)
    - Related Jobs
  • Consulting Member of Technical Staff - AI

    Oracle (Santa Clara, CA)
    …and debug software programs for databases, applications, tools, networks etc.As an AI /ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, ... or Scala + Proven experience designing, implementing, and managing infrastructure for AI /ML or HPC workloads. + Understanding machine learning frameworks and… more
    Oracle (12/05/25)
    - Related Jobs
  • Senior AI /ML Infrastructure…

    General Motors (Sunnyvale, CA)
    **Job Description** **About the Team:** The ** AI Validation Platform** team owns the cloud-agnostic, reliable, and cost-efficient platform that powers GM's AV ... the Role:** We are seeking a Senior ML Infrastructure engineer to help build and scale robust Compute platforms...of cutting-edge GPUs, while also leveling up the platform's reliability . The successful candidate will have experience building and… more
    General Motors (01/07/26)
    - Related Jobs