- Google (Mountain View, CA)
- …Air to meet some of our SREs. + Read a career profile about why a software engineer chose to join SRE . Behind everything our users see online is the architecture ... systems is a true strategy, and a good one._ Site Reliability Engineering ( SRE ) is an engineering discipline that combines software and systems engineering to build… more
- LiveRamp (San Francisco, CA)
- …to build and maintain products operational documentation and setting up product SRE practices** + **Experience working with real-time and NoSQL Databases such as ... and rightsize Kubernetes containers.** + **Work in close collaboration with SRE team members and Engineering organizations based in California, Paris, Nantong,… more
- Celonis (Redwood City, CA)
- …The team applies advanced software engineering and Site Reliability Engineering ( SRE ) principles to drive system reliability, scalability, and operational excellence ... for a fleet of 80+ FedRAMP-compliant microservices running on Kubernetes, applying SRE principles to drive observability, automation, and incident prevention. + Own… more
- Palo Alto Networks (Santa Clara, CA)
- …and actionable insights into our systems' performance and health. **Your Impact** As a Senior SRE with the Cortex Cloud Security Posture Management team, you ... + DevOps/ SRE Expertise - 4+ years of experience as a DevOps/ SRE engineer with a passion for technology and a strong motivation for high reliability… more
- Palo Alto Networks (Santa Clara, CA)
- …**Your Experience** + DevOps/ SRE Expertise: 5+ years of experience as a DevOps/ SRE engineer with a passion for technology and a strong motivation for ... insights into our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: Utilize… more
- NVIDIA (Santa Clara, CA)
- We are seeking a motivated cloud platform Senior Systems Engineer to join our team in building and scaling our cloud-native infrastructure which enables ... experience managing large-scale production clusters + Good understanding of the SRE best practices, alerting and observability + Advanced Kubernetes workload… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
- Palo Alto Networks (Santa Clara, CA)
- …and application development. We are looking for a Sr. Data Platform Engineer with extensive experience in data engineering, cloud infrastructure, and a strong ... background in DevOps, SRE , or system engineering. The ideal candidate will be...for our data platforms (eg, Airflow, Spark clusters), applying SRE and DevOps best practices for performance, reliability, and… more
- General Motors (Mountain View, CA)
- …communities where we live and deliver a better future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements of the ... us and let's innovate! **What You'll Do** + Implement scalable, reliable, secure SRE and Observability platform to monitor health of our production system and… more