- Amazon (Cupertino, CA)
- …designs cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced ... * You will have a fundamental understanding of Reliability statistics/ Reliability tests and/or solid understanding of computer systems to influence… more
- Google (Sunnyvale, CA)
- Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to build and… more
- Ford Motor Company (Long Beach, CA)
- …In this highly interdisciplinary role, you will work with multiple teams and help set reliability targets at the system and subsystem level. You will oversee the ... support an entire vehicle. What you'll do * Define reliability targets for different systems and subsystems... Reliability requirement development, target allocation, cascading the reliability requirements from top level to system … more
- Palo Alto Networks (Santa Clara, CA)
- …insights into our systems ' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: Utilize ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- Coinbase (Sacramento, CA)
- …improvements. * Educate, mentor and hold accountable the engineering team to improve the reliability of our systems and make reliability a core value ... platform - and with it, the future global financial system . To achieve our mission, we're seeking a very...you'll be doing (ie. job duties):* * Improve observability, reliability and availability by defining and measuring key metrics… more
- Rubrik (Sacramento, CA)
- … and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance ... enable teams at Rubrik to develop secure software and protect data and systems with appropriate security controls. Information Security also develops systems to… more
- LiveRamp (San Francisco, CA)
- …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work in close...code, and automate routine tasks** + **Experience with securing systems in a public cloud environment** + **Understands how… more
- Northrop Grumman (San Diego, CA)
- … programs; and key participants in the systems engineering organization. The Senior Principal Reliability Engineer will interface between the ... our employees have incredible opportunities to work on revolutionary systems that impact people's lives around the world today,...and Modernization (N-OSSM) Operating Unit is looking for a Reliability Engineer to join our team based… more
- Abbott (Pleasanton, CA)
- …mothers, female executives, and scientists. **The Opportunity** We're looking for a strong ** Senior Site Reliability Engineer (SRE)** who's ready to roll ... in reliability and observability **What You'll Work On** + ** System Reliability & Performance** : Design and maintain fault-tolerant infrastructure… more