- Google (Mountain View, CA)
- …SRE ensures that Google's services-both our internally critical and our externally-visible systems -have reliability and uptime appropriate to users' needs and a ... + Read a career profile about why a software engineer chose to join SRE. Behind everything our users...systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and… more
- Palo Alto Networks (Santa Clara, CA)
- …insights into our systems ' performance and health. **Your Impact** As a Senior SRE with the Cortex Cloud Security Posture Management team, you will: + Cloud ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more
- General Motors (Mountain View, CA)
- …our customers, including fleet management, energy optimization, transportation logistics, safety systems , and more. To fulfill our mission, we are actively expanding ... future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements...and maintain key elements of the infrastructure health and reliability monitoring for GM's commercial fleet. We are an… more
- ServiceNow, Inc. (Santa Clara, CA)
- …unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
- Palo Alto Networks (Santa Clara, CA)
- …insights into our systems ' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: Utilize ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more
- NVIDIA (Santa Clara, CA)
- …health + Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity + ... Site Reliability Engineering (SRE) at NVIDIA is an engineering...discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination… more
- NVIDIA (Santa Clara, CA)
- …health. + Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity + ... Site Reliability Engineering (SRE) at NVIDIA is an engineering...discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination… more
- Coinbase (Sacramento, CA)
- …improvements. * Educate, mentor and hold accountable the engineering team to improve the reliability of our systems and make reliability a core value ... you'll be doing (ie. job duties):* * Improve observability, reliability and availability by defining and measuring key metrics...and measuring key metrics * Build automation and improve systems to eliminate toil and operations work. * Collaborate… more
- LinkedIn (Mountain View, CA)
- … and troubleshooting production systems at scale. Suggested Skills: . Distributed Systems . Technical Leadership . Infrastructure Reliability . Systems ... passion for distributed technologies and algorithms, API design and systems design, and your passion for writing code that...impact within our company. As a Sr. Staff Software Engineer , you will be a key technical leader and… more