- MongoDB (San Francisco, CA)
- …provide hybrid work accommodation. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) Lead with a strong networking background to join ... secure communication between systems. Their responsibilities encompass network architecture, service mesh, and edge load balancing, ensuring customer data remains… more
- Leidos (Vista, CA)
- **Description** This position will require up to 75% travel Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented ... and capacity planning + Create sustainable systems and services through service automation + Design, develop, troubleshoot, and debug mission critical infrastructure… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... provisioning and management through automation. + Identify areas to improve service resiliency using industry-standard practices. + Detect performance issues and… more
- The Walt Disney Company (Sacramento, CA)
- …high availability, and clear observability + Maintain and improve the reliability of services and infrastructure + Troubleshoot and resolve performance and ... reliability issues across the stack, including cloud resources +...verbal communication skills + **Comfortable working with public cloud service providers (eg AWS, Google, Azure)** + Strong knowledge… more
- LinkedIn (Mountain View, CA)
- …community while making a real impact within our company. As a Sr. Staff Software Engineer , you will be a key technical leader and role model within the organization. ... Suggested Skills: . Distributed Systems . Technical Leadership . Infrastructure Reliability . Systems Infrastructure . Java/Golang/Rust/Python You will Benefit from… more
- Google (Sunnyvale, CA)
- …and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with opportunities to ... who use Google services around the world. We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running… more
- Google (Sunnyvale, CA)
- …qualifications:** + Master's degree in Computer Science or Engineering. Site Reliability Engineering (SRE) combines software and systems engineering to build and ... Google Cloud's services-both our internally critical and our externally-visible systems-have reliability , uptime appropriate to customer's needs and a fast rate of… more
- Rubrik (Palo Alto, CA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- Palo Alto Networks (Santa Clara, CA)
- …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... PKI concepts + Expertise in troubleshooting and resolving cloud infrastructure and service issues, identifying root cause and devising effective solutions for high… more
- Insight Global (Santa Clara, CA)
- …Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew that develops and maintains ... ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by… more