- Walmart (Sunnyvale, CA)
- …from fraudulent activity across our global transaction platforms. Our DevOps and SRE engineers ensure these systems are reliable, scalable, and secure, enabling ... detection and prevention platforms. + Lead the design and rollout of SRE tooling, including monitoring, alerting, and incident response automation for high-risk… more
- S&P Global (New York, NY)
- …**Cyber and Security Engineering, Quality Engineering (QE), and System Reliability Engineering ( SRE )** . Your core mandate is to eliminate single points of failure, ... Lead the successful consolidation and harmonization of Cyber, QE, and SRE teams, creating a single, cohesive function. Establish **unified operational metrics**… more
- NetApp (Morrisville, NC)
- …infrastructure. You will leverage your expertise in NetApp products, ONTAP knowledge, and SRE automation and security to ensure the robustness and efficiency of our ... + **Infrastructure Understanding:** Analyze and understand both on-premises and cloud infrastructure requirements, ensuring robust and scalable solutions. + **… more
- HCA Healthcare (Nashville, TN)
- …unlock possibilities, and care like family. As a Senior Site Reliability Engineer ( SRE ), you will provide SRE best practices for mission-critical applications ... Skills, Abilities, Behaviors:** + Knowledge of infrastructure, frameworks, and software/ cloud design patterns for implementing applications in the cloud… more
- MongoDB (San Francisco, CA)
- **The Team** Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that support the ... broader engineering organization. Among these are our multi- cloud -provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems.… more
- Palo Alto Networks (Santa Clara, CA)
- …precision. **Your Career** We are looking for a proactive and innovative Site Reliability Engineer ( SRE ) to join our growing team. In this role, you will be at the ... blameless postmortems to prevent future occurrences. + Leverage AI for SRE : Utilize AI-powered tools for advanced observability, anomaly detection, predictive… more
- Waystar (Atlanta, GA)
- …are seeking a seasoned and strategic Sr. Manager, Site Reliability Engineering ( SRE ) to lead a high-performing team responsible for the reliability, scalability, and ... a culture of ownership, innovation, and accountability. + Define and drive the SRE roadmap in alignment with business goals and engineering priorities. + Partner… more
- NVIDIA (Santa Clara, CA)
- …the world's most powerful GPU systems. Join our top team and apply your SRE and software engineering skills to craft robust, user-friendly platforms for seamless ML ... reproducibility and scalability across large-scale, distributed GPU clusters. + Apply SRE principles to diagnose, troubleshoot, and resolve complex system issues… more
- iCIMS (Salt Lake City, UT)
- **Job Overview** We are seeking a skilled Engineer, Site Reliability ( SRE ) to contribute to the reliability, scalability, and performance of our multi- cloud SaaS ... reliability. The successful candidate will work within a global SRE team to ensure optimal system performance and customer...goods globally, overnight, with a smile. As the Talent Cloud company, we empower these organizations to attract, engage,… more
- General Motors (Mountain View, CA)
- …Reliability Engineering ( SRE ) or production engineering environment. + Familiarity with cloud platforms such as AWS, GCP, or Azure. + Exposure to observability ... both opportunity and complexity. At General Motors, our Site Reliability Engineering ( SRE ) organization is built on software engineering principles. We design and… more