- Chick-fil-A (Atlanta, GA)
- Overview At Chick-fil-A, Reliability and Monitoring is a technical function which mixes in influence. Across our 3000+ North American Restaurants, cloud, and ... work with our DevOps teams to introduce and hone SRE principles, establish reliability goals, and develop tooling for operational observability. We are a small team… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** As a Site Reliability Operations Engineer within the Global Technology Platforms (GTP) Command and Control Center ... processes that will ensure highest levels of availability and reliability of Walmart's technology stack. You're right for the...right for the job if you are comfortable in monitoring , detecting, major incident response with a technical team… more
- Chick-fil-A (Atlanta, GA)
- …are searching for a skilled and motivated engineer to lead Cloud Operations and Site Reliability Engineering (SRE) for our international Restaurants and to ... implementation, and management of solutions that enhance the efficiency and reliability of our security, network, and infrastructure. You will collaborate closely… more
- Mondelez International (Chicago, IL)
- …conferences, and obtaining relevant certifications. + **Performance Metrics and KPIs** : Reliability engineer is responsible for monitoring and reporting ... to minimize downtime and maximize productivity. + **Root Cause Analysis (RCA)** : Reliability engineer should be adept at conducting root cause analyses to… more
- TEKsystems (Santa Clara, CA)
- Description Summary NVIDIA is looking for a Network Reliability and Operations (NRO) Engineer to support and maintain our cloud network infrastructure. This ... network infrastructures, which include intra-DC, inter-DC, and CSP environments. Network Reliability Operations experience * Knowledge of large scale IP… more
- NVIDIA (Santa Clara, CA)
- …engineers to design, develop and implement a global, dynamic, innovative Service Reliability Operations Center, to provide extraordinary levels of support for ... you will partner with other key members of our organization including Site Reliability Engineering, Security Operations Center, DevOps teams, and other partners… more
- ServiceNow, Inc. (Orlando, FL)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... highly technical engineers who are tasked with maintaining and supporting the reliability , scalability and performance of the automations and platform to manage the… more
- PennyMac (Westlake Village, CA)
- …through the complete mortgage journey. A Typical Day As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7 monitoring and ... database infrastructure and related systems. This role focuses specifically on database operations , performance optimization, and ensuring the reliability of our… more
- The Walt Disney Company (Glendale, CA)
- …We deeply embed in engineering teams to continuously improve system performance and reliability . The Senior Systems Reliability Engineer is responsible for ... architecting resilient platforms, developing automation solutions for deployment and operations , implementing robust monitoring and alerting strategies, and… more
- Palo Alto Networks (Santa Clara, CA)
- …highly available. + Automate deployments, monitoring , and alerting to streamline operations and improve reliability . + Diagnose and resolve critical issues, ... is among the largest GCP customers. As a Site Reliability Engineer on the CDSS Advanced URL...automation using Python, Golang, or shell scripting to streamline operations and enhance system reliability . + Production… more
Related Job Searches:
Engineer,
Monitoring,
Monitoring Engineer,
Operations,
Operations Engineer,
Operations Reliability Engineer,
Reliability