• Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
    NVIDIA (11/05/25)
    - Related Jobs
  • Senior Site Reliability

    Centene Corporation (Austin, TX)
    …are focused on managing and maintaining optimum platform infrastructure performance, reliability , and security using **SRE practices** , observability tools, manual ... to determine the root cause of issues and develop solutions for improved reliability . + Troubleshoots and resolves more complex problems with systems and services… more
    Centene Corporation (12/14/25)
    - Related Jobs
  • Sr Site Reliability

    SitusAMC (Columbus, OH)
    …inventory accounts + Ability to administrate source code repositories + Work with Sr . Engineers on scoping out proposed Projects + Become member of Incident response ... bonus as determined by bonus program guidelines, position eligibility and SitusAMC Senior Management approval. SitusAMC offers PTO and paid holidays, the terms of… more
    SitusAMC (12/03/25)
    - Related Jobs
  • Site Reliability Engineer

    MongoDB (New York, NY)
    …Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also work closely with other SWE and SRE ... spans the globe - including several cloud providers + Build for reliability , making services and infrastructure available, resilient, fault tolerant and self-healing… more
    MongoDB (11/26/25)
    - Related Jobs
  • Sr . Site Reliability

    MetLife (Cary, NC)
    …The Opportunity As the SHIELD Architect, you will serve as the senior technical authority for the enterprise's experience protection layer. You will define ... and ServiceNow or equivalent tooling. * Strong diagnostic, analytical, and pattern- recognition capabilities. * Ability to operate calmly under pressure in critical… more
    MetLife (12/06/25)
    - Related Jobs
  • Senior Site Reliability

    RELX INC (Columbia, SC)
    Are you a collaborative Azure Sr SRE looking to work for a mission driven global organization? Do you possess advanced Azure SRE skills and looking to put those ... compliance, reusability, and security. + Individuals are responsible for challenging reliability and toil reduction projects. At this level, SREs have hands-on… more
    RELX INC (10/18/25)
    - Related Jobs
  • Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... efficiency of services and drive efficiency with software and hardware optimizations ( SR -IOV/ DPU) + Experience with Technologies like eBPF and XDP for Observability… more
    NVIDIA (11/20/25)
    - Related Jobs
  • Site Reliability Engineer

    MongoDB (Chicago, IL)
    We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE, ... with a strong focus on security work, with ideally 2+ years in a senior or staff engineering role Security Mindset: + A comprehensive understanding of all facets… more
    MongoDB (10/29/25)
    - Related Jobs
  • Senior Staff Site Reliability

    Zscaler (San Jose, CA)
    …a cloud-first strategy. We're seeking a highly skilled and experienced SRE Platform Engineer to join our SRE Cloud Platform Engineering Team. Reporting to the ... Director of Cloud Engineering, you will be responsible for: + Designing and maintaining scalable infrastructure solutions to support Zscaler's global cloud services + Enhancing observability practices across infrastructure and applications through monitoring,… more
    Zscaler (12/08/25)
    - Related Jobs
  • Junior Site Reliability

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... that develops and maintains sophisticated internal cloud provisioning products. The team works with various other business units such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their… more
    Insight Global (12/07/25)
    - Related Jobs