• Senior ML Platform Engineer - Lepton

    NVIDIA (Santa Clara, CA)
    …generative AI to autonomous vehicles. We are now looking for a ML Platform Engineer to help accelerate the next era of machine learning innovation. In this role, ... the world's most powerful GPU systems. Join our top team and apply your SRE and software engineering skills to craft robust, user-friendly platforms for seamless ML… more
    NVIDIA (11/04/25)
    - Related Jobs
  • Senior Staff Machine Learning…

    ServiceNow, Inc. (Santa Clara, CA)
    …and de-risk AI technologies that unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer you will:** + Contribute to the design, ... sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how...reliable. + Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements… more
    ServiceNow, Inc. (12/02/25)
    - Related Jobs
  • Site Reliability Engineer ( Senior

    MongoDB (San Francisco, CA)
    …the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer ( SRE ) with a strong networking background to join the Fabric ... **The Team** Platform Engineering is the department within SRE that is responsible for a range of...secure and efficient communication between our services. As an SRE on the Fabric team, you will leverage your… more
    MongoDB (10/07/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
    NVIDIA (11/05/25)
    - Related Jobs
  • Senior Staff Cloud Infrastructure…

    Zscaler (San Jose, CA)
    …and agility with a cloud-first strategy. We're looking for an experienced Senior Staff Cloud Infrastructure Engineer to join our Infrastructure Architecture ... CA office five days a week. Reporting to the Senior Director of Cloud Operations (Cloud Platform and Architecture),...+ 10+ years of experience in cloud engineering, DevOps, SRE , or infrastructure roles with a strong background in… more
    Zscaler (11/26/25)
    - Related Jobs
  • ( Senior ) Software Engineer

    pony.ai (Fremont, CA)
    …globally. Pony.ai went public at NASDAQ in November 2024. Responsibilities As a ( Senior ) Kubernetes Engineer , you will: + Design, operate, and optimize ... security policies, and operational guidelines. + Contribute to observability and SRE practices to ensure reliability at scale (SLOs, incident reviews, metrics-driven… more
    pony.ai (09/16/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …equivalent experience. + 10+ years operating large-scale production systems in roles such as SRE , Production Engineer , or Platform Engineer and 5+ years ... to model training clusters to real-time decision making. This isn't a typical SRE role, you'll help design and run NVIDIA's global telemetry backbone, the platform… more
    NVIDIA (12/06/25)
    - Related Jobs
  • Senior Software Engineer , AI…

    LinkedIn (Mountain View, CA)
    …to optimize their models and deliver the best performance possible. As a Senior Software Engineer , you will have first-hand opportunities to advance one ... billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra team, you will...work well in a diverse, team-focused environment with other SRE /SWE Engineers, Project Managers, etc. + Experience building ML… more
    LinkedIn (12/05/25)
    - Related Jobs
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …high-performance checkout services running in Edge and Cloud. As a Site Reliability Engineer in the CPC Team, you will work with L2, Other dependent Applications, ... reusable tools, library, dashboards which can be used across DevOps/ SRE teams **What you'll bring:** + Bachelor's degree in...related discipline + 5+ years of hands-on related to SRE , Operations ; Development experience with Java Script, Java,… more
    Walmart (11/14/25)
    - Related Jobs
  • Senior AI Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    We are seeking a AI Infrastructure Engineer to integrate third-party infrastructure partners into NVIDIA's operational excellence programs. This cross-functional ... well as managing vendor relationships. You will partner with engineering, SRE , product, and third-party infrastructure providers to achieve operational excellence.… more
    NVIDIA (10/24/25)
    - Related Jobs