• Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
    NVIDIA (09/22/25)
    - Related Jobs
  • Senior Site Reliability

    Coinbase (Charlotte, NC)
    …Q3 2023. *What you'll be doing (ie. job duties):* * Improve observability, reliability and availability by defining and measuring key metrics * Build automation and ... service disruptions and automate incident response * Proactively find and analyze reliability problems across our business units and stack, then design and implement… more
    Coinbase (08/19/25)
    - Related Jobs
  • Senior Site Reliability

    Centene Corporation (Madison, WI)
    …are focused on managing and maintaining optimum platform infrastructure performance, reliability , and security using SRE practices, observability tools, manual and ... to determine the root cause of issues and develop solutions for improved reliability . + Troubleshoots and resolves more complex problems with systems and services… more
    Centene Corporation (09/18/25)
    - Related Jobs
  • Senior Site Reliability

    MongoDB (New York, NY)
    …Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also work closely with other SWE and SRE ... spans the globe - including several cloud providers + Build for reliability , making services and infrastructure available, resilient, fault tolerant and self-healing… more
    MongoDB (08/28/25)
    - Related Jobs
  • Senior Site Reliability

    Rubrik (Jackson, MS)
    …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
    Rubrik (08/20/25)
    - Related Jobs
  • Senior Site Reliability

    LiveRamp (AR)
    …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Terraform scripts in support of the mission in close collaboration with DevOps team** + **Maintain and enhance Engineering Operational Documentation for supported products.** + **Provide expertise to build and maintain products operational documentation and… more
    LiveRamp (08/07/25)
    - Related Jobs
  • Senior Lead Site Reliability

    JPMorgan Chase (Plano, TX)
    …of technology at a globally recognized firm, driven by pride in ownership. As a ** Senior Lead Site Reliability Engineering** at JPMorgan Chase within the ... situations with composure and tact. **Job responsibilities** + Demonstrates expertise in site reliability principles and demonstrates an understanding of the… more
    JPMorgan Chase (09/14/25)
    - Related Jobs
  • Senior Staff Site Reliability

    Palo Alto Networks (Santa Clara, CA)
    …actionable insights into our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: ... influence the operability of the product and ensure the reliability and availability of our services **Your Experience** +...DevOps/SRE Expertise: 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong… more
    Palo Alto Networks (07/15/25)
    - Related Jobs
  • Sr. Software Reliability Engineer

    Abbott (Pleasanton, CA)
    …working mothers, female executives, and scientists. **The Opportunity** We're looking for a strong ** Senior Site Reliability Engineer (SRE)** who's ready ... and compliant with healthcare regulations-this is the role for you. As a Senior SRE, you'll work closely with engineering, QA, cybersecurity, and regulatory teams to… more
    Abbott (09/20/25)
    - Related Jobs
  • Senior Software Engineer

    Google (Pittsburgh, PA)
    …some of our SREs. + Read a career profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a software engineer ... efficient large-scale systems is a true strategy, and a good one._ Site Reliability Engineering (SRE) is an engineering discipline that combines software and… more
    Google (09/27/25)
    - Related Jobs