- Walmart (Sunnyvale, CA)
- …Define SLAs, SLOs, and error-budget policies at the platform level; partner with Site Reliability Engineering to implement chaos experiments, canary ... + **Operational Excellence:** Expertise in defining platform-wide SLAs/SLOs, automating reliability frameworks (chaos engineering , self-healing), and leading… more
- Cardinal Health (Sacramento, CA)
- …microservices, public cloud alongside some more traditional distributed systems and databases. The Site Reliability Engineering (SRE) Team is an integrated ... and positive user experiences at every interaction. As a Site Reliability Engineer at Sonexus, you'll be...engineering , dev, and infrastructure teams to solve complex reliability challenges using automation and observability + Maintain and… more
- Amazon (Cupertino, CA)
- …and kernel drivers. - 5+ years or more in software development, systems development, SRE ( Site Reliability Engineering ), or Resilience Engineering - 5+ ... you to own them to completion. The AWS Hardware Engineering (HWEng) team creates server designs for Amazon's innovative...- 2+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience… more
- Palo Alto Networks (Santa Clara, CA)
- …and resolve production incidents **Your Experience** + 4+ years of experience in DevOps, Site Reliability Engineering , or Cloud Infrastructure roles + Strong ... that powers our large-scale cloud platform. You will work closely with engineering teams to enable fast and reliable software delivery, optimize system performance,… more
- Cisco (San Jose, CA)
- …spear in interacting with our customers. Our CRE team adapts the best practices of Site Reliability Engineering (SRE) and applies them to our customers. As ... at a large production scale. + Extensive knowledge of Customer Reliability Engineering (CRE) practices, including Production Readiness Reviews (PRRs),… more
- Red Hat (Sacramento, CA)
- …in a role like Software Engineering , Performance Engineering , or Site Reliability Engineering (SRE). + Significant hands-on experience deploying and ... The Red Hat Performance and Scale Engineering team is looking for an experienced Senior...both internally and externally, and provide continuous feedback to Engineering teams and the leadership **Required Skills:** + Minimum… more
- Ford Motor Company (Long Beach, CA)
- …Strong working in CI/CD environments + Experience with software operations (DevOps, Site Reliability Engineering , Observability, Support and Maintenance) ... and providing feedback on product designs and architectures with a software engineering focus. + Evaluate and recommend new and emerging products and technologies.… more
- The Walt Disney Company (Glendale, CA)
- …+ 10+ years of experience across Infrastructure, DevOps, Software Engineering , or Site Reliability Engineering in large-scale cloud environments. + Deep ... innovation across the cloud platforms that support both data and software engineering teams-designing systems that are secure, scalable, and built to accelerate… more
- ServiceNow, Inc. (San Diego, CA)
- …OS, applications, databases, networks, web and application servers. Prior experience in Site Reliability Engineering /DevOps and managing large-scale server ... Team** As a key member of the Systems Administration team within Operations Engineering , you will be responsible for the administration and operations of the global… more
- SpaceX (Hawthorne, CA)
- …engineering role in lieu of a degree. + 5+ years in IT operations, site ‑ reliability , or infrastructure engineering . + 3+ years administering or developing ... incidents, mentor peers, and drive data‑driven improvements that raise service reliability across the entire IT organization. You will contribute to building… more