- CommScope, Inc. (Sunnyvale, CA)
- …the World:** RUCKUS Networks is seeking an experienced **Site Reliability Engineering ( SRE ) Manager** to lead our NAM and APAC operations teams in transforming ... traditional operations into modern SRE practices. This high-impact leadership role will drive operational excellence, mentor engineering managers, and spearhead … more
- ServiceNow, Inc. (San Diego, CA)
- …8,100 customers, including 85% of the Fortune 500(R). Our intelligent cloud -based platform seamlessly connects people, systems, and processes to empower ... holding a green card, will be considered._** **_The Federal SRE Team has 3 shifts that provide 24x7 production...that provide 24x7 production support for our Government Community Cloud infrastructure._** _Below are some highlights._ + No on-call… more
- Walmart (Sunnyvale, CA)
- …tech organization. Included in this are data platforms, enterprise architecture, DevOps, cloud computing, and infrastructure. All of these products and services are ... customers and associates globally. You'll lead the transformation of traditional SRE practices into cutting-edge, self-healing platforms that serve as the… more
- Google (Sunnyvale, CA)
- …production operations for isolated systems and expertise in distributed cloud /on-premise. + Experience building or operating large-scale infrastructure platforms and ... distributed systems, with technical knowledge of public/private cloud , GPUs, virtualization, and containerization. + Ability to resolve deep systemic operational… more
- Palo Alto Networks (Santa Clara, CA)
- …large infrastructure and is one of the biggest GCP customers. As a Principal SRE , you'll be at the forefront of building and maintaining highly reliable, scalable, ... and secure cloud infrastructure within a FedRAMP compliant environment. You'll drive...a FedRAMP compliant environment. You'll drive operational excellence, champion SRE best practices, and work collaboratively to ensure our… more
- Walmart (Sunnyvale, CA)
- …from fraudulent activity across our global transaction platforms. Our DevOps and SRE engineers ensure these systems are reliable, scalable, and secure, enabling ... detection and prevention platforms. + Lead the design and rollout of SRE tooling, including monitoring, alerting, and incident response automation for high-risk… more
- MongoDB (San Francisco, CA)
- **The Team** Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that support the ... broader engineering organization. Among these are our multi- cloud -provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems.… more
- Palo Alto Networks (Santa Clara, CA)
- …precision. **Your Career** We are looking for a proactive and innovative Site Reliability Engineer ( SRE ) to join our growing team. In this role, you will be at the ... blameless postmortems to prevent future occurrences. + Leverage AI for SRE : Utilize AI-powered tools for advanced observability, anomaly detection, predictive… more
- NVIDIA (Santa Clara, CA)
- …the world's most powerful GPU systems. Join our top team and apply your SRE and software engineering skills to craft robust, user-friendly platforms for seamless ML ... reproducibility and scalability across large-scale, distributed GPU clusters. + Apply SRE principles to diagnose, troubleshoot, and resolve complex system issues… more
- iCIMS (Sacramento, CA)
- **Job Overview** We are seeking a skilled Engineer, Site Reliability ( SRE ) to contribute to the reliability, scalability, and performance of our multi- cloud SaaS ... reliability. The successful candidate will work within a global SRE team to ensure optimal system performance and customer...goods globally, overnight, with a smile. As the Talent Cloud company, we empower these organizations to attract, engage,… more
Recent Jobs
-
Senior Member Technical Staff
- Oracle (Nashville, TN)
-
Tech Compressor Lead (Jacksonville)
- TECO Energy (Jacksonville, FL)