• Senior System Software Engineer , AI…

    NVIDIA (Santa Clara, CA)
    …We are excited to have a fun-loving person like you join our team! As a Senior Software Engineer , you will be responsible for building industry insights into AI ... NVIDIA is hiring senior system software engineers in its Infrastructure, Planning...engineers across software development, DevOps, and Site Reliability Engineering ( SRE ) activities. What you'll be doing: + Responsible for… more
    NVIDIA (10/14/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
    NVIDIA (10/15/25)
    - Related Jobs
  • Senior Customer Reliability Engineer

    Google (Sunnyvale, CA)
    Senior Customer Reliability Engineer , Reliability Incident Management _corporate_fare_ Google _place_ New York, NY, USA; Austin, TX, USA; +2 more; +1 more ... Customer Engineering or professional services. + Experience in applying SRE principles to improve the reliability and performance of...to connect with customers, employees and partners. As a Senior Customer Reliability Engineer , you will be… more
    Google (10/17/25)
    - Related Jobs
  • Senior Staff Network Automation…

    ServiceNow, Inc. (San Diego, CA)
    It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... keep reliable, we don't throw people at the problem-we ** engineer it away** with software. You'll join Network Reliability...is expanding in scale and complexity. We need a senior leader / builder who can **own design through… more
    ServiceNow, Inc. (10/03/25)
    - Related Jobs
  • Senior Staff Machine Learning…

    ServiceNow, Inc. (Santa Clara, CA)
    …and de-risk AI technologies that unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer you will:** + Contribute to the design, ... sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how...reliable. + Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements… more
    ServiceNow, Inc. (09/27/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... reviews, assist in root cause identification, and write RCA reports. + Deliver SRE solutions in a globally distributed, multi-cloud hybrid environment - AWS, GCP,… more
    NVIDIA (09/17/25)
    - Related Jobs
  • ( Senior ) Software Engineer

    pony.ai (Fremont, CA)
    …globally. Pony.ai went public at NASDAQ in November 2024. Responsibilities As a ( Senior ) Kubernetes Engineer , you will: + Design, operate, and optimize ... security policies, and operational guidelines. + Contribute to observability and SRE practices to ensure reliability at scale (SLOs, incident reviews, metrics-driven… more
    pony.ai (09/16/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior Site Reliability Engineer to work in IPP (Infrastructure, Planning and Process). IPP is a global organization within NVIDIA. This ... supporting the latest Nvidia hardware and technologies + Develop SRE agents that will help streamline daily Cost of...best practices, and designing scalable, resilient systems based on SRE principles + Ability to debug and analyze source… more
    NVIDIA (09/11/25)
    - Related Jobs
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …high-performance checkout services running in Edge and Cloud. As a Site Reliability Engineer in the CPC Team, you will work with L2, Other dependent Applications, ... reusable tools, library, dashboards which can be used across DevOps/ SRE teams **What you'll bring:** + Bachelor's degree in...related discipline + 5+ years of hands-on related to SRE , Operations ; Development experience with Java Script, Java,… more
    Walmart (08/15/25)
    - Related Jobs
  • Senior Software Engineer , AI…

    LinkedIn (Mountain View, CA)
    …to optimize their models and deliver the best performance possible. As a Senior Software Engineer , you will have first-hand opportunities to advance one ... billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra team, you will...work well in a diverse, team-focused environment with other SRE /SWE Engineers, Project Managers, etc. + Experience building ML… more
    LinkedIn (10/18/25)
    - Related Jobs