- NVIDIA (Santa Clara, CA)
- …impact on the world. We are seeking a highly skilled and experienced Network Site Reliability Engineer (SRE) to join our Enterprise Network Operations ... practice in network operations or related fields concentrating on automation & site reliability engineering. Familiarity with both enterprise and the data… more
- Palo Alto Networks (Santa Clara, CA)
- …Networks runs a large infrastructure and is one of the largest GCP customers. As a Principle Site Reliability Engineer for the TDP team, you will be part of ... running on this infrastructure. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability . Our Infrastructure… more
- RELX INC (Sacramento, CA)
- …with modern multi cloud platforms and cutting-edge tools to enhance system reliability , visibility, and security across the entire development lifecycle. If you are ... here. About the Role: This position individuals are responsible for challenging reliability and toil reduction projects. Key Responsibilities: + Monitoring & … more
- LinkedIn (Mountain View, CA)
- …equivalent role at a high-growth or web-scale technology company Suggested Skills + Site Reliability Engineering (SRE) + Leadership + Large scale infrastructure ... + Serve as a senior technical leader driving the long-term reliability and observability strategy across LinkedIn's infrastructure + Re-architect… more
- Intuit (San Diego, CA)
- **Overview** Come join the Identity Team as Site Reliability / DevOps Engineer (System Engineering). Identity is at the heart of all offerings across Intuit ... platforms to enable faster and automatic recovery. + Design and develop observability components for massive scale platforms, to detect issues quickly and isolate… more
- MongoDB (San Francisco, CA)
- …or remotely in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to ... these are our multi-cloud-provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems. The Fabric team manages the… more
- NVIDIA (Santa Clara, CA)
- …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... optimizations (SR-IOV/ DPU) + Experience with Technologies like eBPF and XDP for Observability & DDoS mitigation + Collect and review system data for capacity and… more
- Palo Alto Networks (Santa Clara, CA)
- …are robust and performant. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability . Our Infrastructure ... Platform stack includes Terraform, Kubernetes, GitLab CI/CD, GitOps, Prometheus, Grafana, Loki, Docker, GCP, Backstage, MySQL, PagerDuty, FireHydrant, Python, Bash, Java, NodeJS and Go. **Your Impact** + **Design, build, and operate** reliable, secure Cloud… more
- Abbott (Pleasanton, CA)
- …mothers, female executives, and scientists. **The Opportunity** We're looking for a strong **Senior Site Reliability Engineer (SRE)** who's ready to roll up ... , helping monitor systems, respond to incidents, and drive continuous improvements in reliability and observability **What You'll Work On** + **System … more
- The Walt Disney Company (Glendale, CA)
- …We deeply embed in engineering teams to continuously improve system performance and reliability . The Senior Systems Reliability Engineer is responsible for ... + Identify and automate manual operational processes ("toil") to improve system reliability and engineer productivity. + 24x7 on-call operational support. **Must… more