• (USA) Principal, Software Engineer

    Walmart (Sunnyvale, CA)
    …Define SLAs, SLOs, and error-budget policies at the platform level; partner with Site Reliability Engineering to implement chaos experiments, canary ... + **Operational Excellence:** Expertise in defining platform-wide SLAs/SLOs, automating reliability frameworks (chaos engineering , self-healing), and leading… more
    Walmart (05/16/25)
    - Related Jobs
  • SRE Engineer

    Cardinal Health (Sacramento, CA)
    …microservices, public cloud alongside some more traditional distributed systems and databases. The Site Reliability Engineering (SRE) Team is an integrated ... and positive user experiences at every interaction. As a Site Reliability Engineer at Sonexus, you'll be...engineering , dev, and infrastructure teams to solve complex reliability challenges using automation and observability + Maintain and… more
    Cardinal Health (08/08/25)
    - Related Jobs
  • Sr. System Development Engineer, Solid State…

    Amazon (Cupertino, CA)
    …and kernel drivers. - 5+ years or more in software development, systems development, SRE ( Site Reliability Engineering ), or Resilience Engineering - 5+ ... you to own them to completion. The AWS Hardware Engineering (HWEng) team creates server designs for Amazon's innovative...- 2+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience… more
    Amazon (06/11/25)
    - Related Jobs
  • Sr DevOps Engineer (Cortex)

    Palo Alto Networks (Santa Clara, CA)
    …and resolve production incidents **Your Experience** + 4+ years of experience in DevOps, Site Reliability Engineering , or Cloud Infrastructure roles + Strong ... that powers our large-scale cloud platform. You will work closely with engineering teams to enable fast and reliable software delivery, optimize system performance,… more
    Palo Alto Networks (07/24/25)
    - Related Jobs
  • Technical Consulting Engineer

    Cisco (San Jose, CA)
    …spear in interacting with our customers. Our CRE team adapts the best practices of Site Reliability Engineering (SRE) and applies them to our customers. As ... at a large production scale. + Extensive knowledge of Customer Reliability Engineering (CRE) practices, including Production Readiness Reviews (PRRs),… more
    Cisco (08/08/25)
    - Related Jobs
  • Senior Software Engineer

    Red Hat (Sacramento, CA)
    …in a role like Software Engineering , Performance Engineering , or Site Reliability Engineering (SRE). + Significant hands-on experience deploying and ... The Red Hat Performance and Scale Engineering team is looking for an experienced Senior...both internally and externally, and provide continuous feedback to Engineering teams and the leadership **Required Skills:** + Minimum… more
    Red Hat (08/08/25)
    - Related Jobs
  • Senior Engineer, iOS

    Ford Motor Company (Long Beach, CA)
    …Strong working in CI/CD environments + Experience with software operations (DevOps, Site Reliability Engineering , Observability, Support and Maintenance) ... and providing feedback on product designs and architectures with a software engineering focus. + Evaluate and recommend new and emerging products and technologies.… more
    Ford Motor Company (07/30/25)
    - Related Jobs
  • Principal Platform Engineer, Infrastructure…

    The Walt Disney Company (Glendale, CA)
    …+ 10+ years of experience across Infrastructure, DevOps, Software Engineering , or Site Reliability Engineering in large-scale cloud environments. + Deep ... innovation across the cloud platforms that support both data and software engineering teams-designing systems that are secure, scalable, and built to accelerate… more
    The Walt Disney Company (07/17/25)
    - Related Jobs
  • Senior Linux System Admin - Federal

    ServiceNow, Inc. (San Diego, CA)
    …OS, applications, databases, networks, web and application servers. Prior experience in Site Reliability Engineering /DevOps and managing large-scale server ... Team** As a key member of the Systems Administration team within Operations Engineering , you will be responsible for the administration and operations of the global… more
    ServiceNow, Inc. (07/30/25)
    - Related Jobs
  • Sr. IT Operations Engineer

    SpaceX (Hawthorne, CA)
    engineering role in lieu of a degree. + 5+ years in IT operations, sitereliability , or infrastructure engineering . + 3+ years administering or developing ... incidents, mentor peers, and drive data‑driven improvements that raise service reliability across the entire IT organization. You will contribute to building… more
    SpaceX (08/02/25)
    - Related Jobs