• Senior Staff Machine Learning Engineer

    ServiceNow, Inc. (Santa Clara, CA)
    …experiences in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the design, development ... sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how...and implementation of infrastructure, platform, deployment and observability features that power AI workloads. + Collaborate with… more
    ServiceNow, Inc. (08/08/25)
    - Related Jobs
  • Site Reliability Engineer

    Celonis (Redwood City, CA)
    …and resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability , ... join us. **The Team** As a member of our Reliability Engineering team, you will play a critical role...(AWS, Azure, or GCP) and modern cloud monitoring system observability frameworks (eg, Datadog). + Working knowledge developing and… more
    Celonis (07/18/25)
    - Related Jobs
  • Senior Site Reliability

    Coinbase (Sacramento, CA)
    …is expected and fully supported. Coinbase is hiring! We are looking for an experienced Site Reliability Engineer (SRE) to join the IT Operations Corporate ... cause analysis, and blameless retrospectives * Define metrics and bolster monitoring/ observability across corporate IAM systems * Participate in regular on-call… more
    Coinbase (08/09/25)
    - Related Jobs
  • Staff Site Reliability

    MongoDB (San Francisco, CA)
    …or remotely in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to ... these are our multi-cloud-provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems. The Fabric team manages the… more
    MongoDB (07/08/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... GCP, Azure. + Demonstrated proficiency with end-to-end SRE capabilities and observability . + Proficient in monitoring, metrics gathering, APM, container management,… more
    NVIDIA (07/01/25)
    - Related Jobs
  • Lead, Site Reliability

    MongoDB (San Francisco, CA)
    …office, we provide hybrid work accommodation. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) Lead with a strong networking ... these are our multi-cloud-provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems. The Fabric team manages the… more
    MongoDB (06/17/25)
    - Related Jobs
  • Senior Software Engineer , Infrastructure…

    Coinbase (Sacramento, CA)
    …wide system's reliability and less customer impact . As a *Senior Software Engineer * you will help to promote reliability culture across Coinbase. You would ... on a daily basis. *What you'll be doing (ie. job duties):* * Improve observability , reliability and availability by defining and measuring key metrics * Build… more
    Coinbase (08/09/25)
    - Related Jobs
  • Sr. Reliability Engineer

    Verint Systems, Inc. (Sacramento, CA)
    …opportunities. Learn more at www.verint.com . **Overview of Job Function:** Verint's Sr. Reliability Engineer is responsible for all aspects of the development ... platforms and applications. In this highly skilled, hands-on role, our Sr. Reliability Engineer ensures the scalability, availability, performance, and … more
    Verint Systems, Inc. (06/17/25)
    - Related Jobs
  • Senior Site Reliability

    Rubrik (Palo Alto, CA)
    …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
    Rubrik (08/07/25)
    - Related Jobs
  • Staff Reliability Engineer

    Celonis (Redwood City, CA)
    …and resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability , ... + Join a highly technical, collaborative, and innovation-driven team that blends Site Reliability Engineering with modern Software Engineering practices to build… more
    Celonis (07/31/25)
    - Related Jobs