• Software Engineer , Reliability

    DoorDash (San Francisco, CA)
    …ship, observe, and remediate production systems . About the Role As a Software Engineer on Reliability Platforms, you'll help design and build the systems ... About the Team The Reliability Platforms organization is part of DoorDash's Production...of how engineers safely change, observe, and operate production systems . Our mission is to enable teams to confidently… more
    DoorDash (10/03/25)
    - Related Jobs
  • Site Reliability Operations Engineer

    PennyMac (Westlake Village, CA)
    …quickly and accurately, is critical to the success of anyone in this role. The Engineer III, Site Reliability Operations will: + Monitoring - Oversee 24/7 health ... A Typical Day As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7...timely and accurate resolution of service disruptions + Advanced Systems Administration - Perform and troubleshoot a wide range… more
    PennyMac (08/07/25)
    - Related Jobs
  • Staff Software Engineer , Site…

    Google (Mountain View, CA)
    Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ San Francisco, CA, USA; Mountain View, CA, USA; +2 more; +1 more ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to build and… more
    Google (10/01/25)
    - Related Jobs
  • Senior Software Engineer , Site…

    Google (Sunnyvale, CA)
    Senior Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Durham, NC, USA; Raleigh, NC, USA; +3 more; +2 more **Mid** Experience ... SRE ensures that Google's services-both our internally critical and our externally-visible systems -have reliability and uptime appropriate to users' needs and a… more
    Google (10/01/25)
    - Related Jobs
  • Staff Reliability Engineer

    Celonis (Redwood City, CA)
    …engineering and Site Reliability Engineering (SRE) principles to drive system reliability , scalability, and operational excellence across the organization. ... Engineering with modern Software Engineering practices to build resilient and scalable systems . + Lead reliability efforts for a fleet of 80+ FedRAMP-compliant… more
    Celonis (07/31/25)
    - Related Jobs
  • Senior System Reliability

    Ford Motor Company (Long Beach, CA)
    …In this highly interdisciplinary role, you will work with multiple teams and help set reliability targets at the system and subsystem level. You will oversee the ... support an entire vehicle. What you'll do * Define reliability targets for different systems and subsystems... Reliability requirement development, target allocation, cascading the reliability requirements from top level to system more
    Ford Motor Company (09/03/25)
    - Related Jobs
  • Network Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …actualize the SRE vision for our network infrastructure. We are looking for an engineer who is passionate about the network and making its operation seamless with a ... be focused on user satisfaction and brilliance in Network Operations. This SRE engineer will focus on tackling significant projects and is committed to fostering a… more
    NVIDIA (07/26/25)
    - Related Jobs
  • Sr. Site Reliability Engineer

    Amazon (Culver City, CA)
    …and systems in AWS. The team will operationalize the stability and reliability of these systems and discover innovative ways to scale and operate ... improvements within existing frameworks, tools and processes to continuously improve systems . Site Reliability Engineers focus on automating infrastructure at… more
    Amazon (09/09/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
    NVIDIA (10/02/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    Palo Alto Networks (Santa Clara, CA)
    …including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you ... to develop innovative solutions that provide clear and actionable insights into our systems ' performance and health. **Your Impact** As a Senior Staff SRE with the… more
    Palo Alto Networks (10/03/25)
    - Related Jobs