- ServiceNow, Inc. (Santa Clara, CA)
- …experiences in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the design, development ... sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how...and implementation of infrastructure, platform, deployment and observability features that power AI workloads. + Collaborate with… more
- Celonis (Redwood City, CA)
- …and resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability , ... join us. **The Team** As a member of our Reliability Engineering team, you will play a critical role...(AWS, Azure, or GCP) and modern cloud monitoring system observability frameworks (eg, Datadog). + Working knowledge developing and… more
- Coinbase (Sacramento, CA)
- …is expected and fully supported. Coinbase is hiring! We are looking for an experienced Site Reliability Engineer (SRE) to join the IT Operations Corporate ... cause analysis, and blameless retrospectives * Define metrics and bolster monitoring/ observability across corporate IAM systems * Participate in regular on-call… more
- MongoDB (San Francisco, CA)
- …or remotely in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to ... these are our multi-cloud-provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems. The Fabric team manages the… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... GCP, Azure. + Demonstrated proficiency with end-to-end SRE capabilities and observability . + Proficient in monitoring, metrics gathering, APM, container management,… more
- MongoDB (San Francisco, CA)
- …office, we provide hybrid work accommodation. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) Lead with a strong networking ... these are our multi-cloud-provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems. The Fabric team manages the… more
- Coinbase (Sacramento, CA)
- …wide system's reliability and less customer impact . As a *Senior Software Engineer * you will help to promote reliability culture across Coinbase. You would ... on a daily basis. *What you'll be doing (ie. job duties):* * Improve observability , reliability and availability by defining and measuring key metrics * Build… more
- Verint Systems, Inc. (Sacramento, CA)
- …opportunities. Learn more at www.verint.com . **Overview of Job Function:** Verint's Sr. Reliability Engineer is responsible for all aspects of the development ... platforms and applications. In this highly skilled, hands-on role, our Sr. Reliability Engineer ensures the scalability, availability, performance, and … more
- Rubrik (Palo Alto, CA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- Celonis (Redwood City, CA)
- …and resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability , ... + Join a highly technical, collaborative, and innovation-driven team that blends Site Reliability Engineering with modern Software Engineering practices to build… more