• Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …once they are live by measuring and monitoring availability, latency and overall system health + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
    NVIDIA (08/02/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
    NVIDIA (08/01/25)
    - Related Jobs
  • Senior Systems Reliability

    The Walt Disney Company (Sacramento, CA)
    …knowledge in system management languages (eg Terraform, Ansible) + Operating systems and systems management (eg Amazon Linux, Windows) + **Multiple scripting ... of the team that provides cutting edge film making systems in the public cloud, focused on automation and...availability, and clear observability + Maintain and improve the reliability of services and infrastructure + Troubleshoot and resolve… more
    The Walt Disney Company (08/08/25)
    - Related Jobs
  • Staff Site Reliability Engineer

    ServiceNow, Inc. (San Diego, CA)
    It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and… more
    ServiceNow, Inc. (07/15/25)
    - Related Jobs
  • Senior Site Reliability

    Rubrik (Palo Alto, CA)
    … and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance ... enable teams at Rubrik to develop secure software and protect data and systems with appropriate security controls. Information Security also develops systems to… more
    Rubrik (08/07/25)
    - Related Jobs
  • Senior Site Reliability

    LiveRamp (San Francisco, CA)
    …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work in close...code, and automate routine tasks** + **Experience with securing systems in a public cloud environment** + **Understands how… more
    LiveRamp (08/07/25)
    - Related Jobs
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... and Driverless Cars to cater to their infrastructure & systems needs. As an SRE, youll also be working...Science, Information Technology, or related field, or equivalent experience. - System admin and Windows admin experience in an on… more
    Insight Global (08/01/25)
    - Related Jobs
  • Staff Site Reliability Engineer

    MongoDB (San Francisco, CA)
    …to build next-generation, AI-powered applications. We are looking for an experienced Staff Engineer for our SRE, InfraSec team, to guide the security of our ... on security work, with ideally 2+ years in a senior or staff engineering role Security Mindset: + A...low-level fundamentals, and how they work together in complex systems Communication and Leadership Skills: + Strong ability to… more
    MongoDB (08/08/25)
    - Related Jobs
  • Senior Manager, Network Site…

    NVIDIA (Santa Clara, CA)
    GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader ... be doing: + Cultivate a top-performing team of Network Site Reliability Engineers through encouraging a culture of collaboration, accountability, and technical… more
    NVIDIA (08/08/25)
    - Related Jobs
  • Senior Site Reliability SRE…

    Rubrik (Palo Alto, CA)
    …we want to talk to you! **About The Role:** Sr . Site Reliability Engineers at Rubrik are systems /software engineers who ensure that Rubrik's infrastructure ... our customers + Design, implement and maintain relational database systems for performance and reliability + Manage...years of experience as a Development, DevOps or Site Reliability Engineer Willing to provide 24/7 coverage… more
    Rubrik (08/07/25)
    - Related Jobs