- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
- Celonis (Redwood City, CA)
- …The team applies advanced software engineering and Site Reliability Engineering ( SRE ) principles to drive system reliability, scalability, and operational excellence ... for a fleet of 80+ FedRAMP-compliant microservices running on Kubernetes, applying SRE principles to drive observability, automation, and incident prevention. + Own… more
- House of Blues (CA)
- Job Summary: JOB DESCRIPTION - Senior AI-Driven Platform Automation Engineer Location: Remote, US Division: Ticketmaster US Line Manager: Director, Software ... at scale. THE JOB We are looking for a Senior AI-Driven Platform Automation Engineer to join our high-impact...Automation Engineer to join our high-impact Platform Automation and SRE group within the Core Concerts division at Ticketmaster.… more
- Palo Alto Networks (Santa Clara, CA)
- …and actionable insights into our systems' performance and health. **Your Impact** As a Senior SRE with the Cortex Cloud Security Posture Management team, you ... incident and alerts management in Site Reliability Engineering + DevOps/ SRE Expertise - 4+ years of experience as a... Expertise - 4+ years of experience as a DevOps/ SRE engineer with a passion for technology and a… more
- Palo Alto Networks (Santa Clara, CA)
- …and actionable insights into our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud ... reliability and availability of our services **Your Experience** + DevOps/ SRE Expertise: 5+ years of experience as a DevOps/...SRE Expertise: 5+ years of experience as a DevOps/ SRE engineer with a passion for technology and a… more
- General Motors (Mountain View, CA)
- …where we live and deliver a better future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements of the infrastructure ... innovate! **What You'll Do** + Implement scalable, reliable, secure SRE and Observability platform to monitor health of our...plan to help you save for retirement; * Global recognition program for peers and leaders to recognize and… more
- Cisco (San Jose, CA)
- Splunk is looking for a Senior Manager to provide day-to-day leadership in our Splunk Cloud TechOps FedRAMP team. This position is responsible for overseeing the ... delivery of Splunk's SaaS customer facing systems. As a Senior Manager of TechOps, you'll lead a team responsible...the business + Partner with our NOC, Support, and SRE teams to deliver agile, highly automated capabilities to… more
- Celonis (Los Angeles, CA)
- …bar for availability, security, and quality. We partner closely with infrastructure, SRE , testing, and application teams to create a seamless, scalable, and ... developer experience. **The Role:** **Why This Role Matters:** As a Senior Software Engineer (Release Infrastructure), you'll take ownership of core systems… more
- ServiceNow, Inc. (Santa Clara, CA)
- …AI technologies that unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute ... well, and remain reliable. + Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements for software tooling. +… more
- Walmart (Sunnyvale, CA)
- …+ Build reusable tools, library, dashboards which can be used across DevOps/ SRE teams **What you'll bring:** + Bachelor's degree in Computer Science, Engineering ... or related discipline + 5+ years of hands-on related to SRE , Operations ; Development experience with Java Script, Java, Restful services, Git, Maven, Jenkins,… more