- PennyMac (Westlake Village, CA)
- …quickly and accurately, is critical to the success of anyone in this role. The Engineer III, Site Reliability Operations will: + Monitoring - Oversee 24/7 ... journey. A Typical Day As a member of the Site Reliability Operations (SRO) team, you will...- Tackle advanced technical issues that are escalated from Engineer I/II. Conduct deep dives into infrastructure and application… more
- Google (Sunnyvale, CA)
- …years of experience designing, analyzing, and troubleshooting large-scale distributed systems. Site Reliability Engineering (SRE) combines software and systems ... grow. **To learn more:** check out our books on Site Reliability Engineering or read a career...or read a career profile about why a Software Engineer chose to join SRE. In this role, with… more
- Palo Alto Networks (Santa Clara, CA)
- …runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer , you will be part of a team supporting the ... SRE and Dev teams in the on-call rotation + Lead root cause analysis of critical business and production...autoscaling enabled + Experience in Production Engineering, DevOps, or Site Reliability + Expertise in the public… more
- NVIDIA (Santa Clara, CA)
- …impact on the world. NVIDIA is looking to hire a deeply technical and creative Site Reliability Engineer to build, support and maintain the next generation ... challenges, automate processes, and iterate for efficiency + Tackle systemic reliability issues with multi-functional teams. + Monitor, optimize, and manage system… more
- NVIDIA (Santa Clara, CA)
- …experience. + Minimum of 8 years of industry experience in network site reliability engineering, network automation, network operations, or related areas. ... team is looking to add a seasoned Technical SRE lead to help actualize the SRE vision for our...for our network infrastructure. We are looking for an engineer who is passionate about the network and making… more
- Leidos (Vista, CA)
- **Description** This position will require up to 75% travel Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for ... and private/public cloud environments via infrastructure-as-code (IaC) and hands-on/client site activites. + Participate in the concept design of reusable… more
- Google (Sunnyvale, CA)
- …**Preferred qualifications:** + Master's degree in Computer Science or Engineering. Site Reliability Engineering (SRE) combines software and systems engineering ... Google Cloud's services-both our internally critical and our externally-visible systems-have reliability , uptime appropriate to customer's needs and a fast rate of… more
- NVIDIA (Santa Clara, CA)
- …cloud. Join us in this exciting endeavor! What You Will Be Doing: + Lead initiatives to transform IT Compute Core Team, architecture to build new service offerings ... NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity… more
- Google (Sunnyvale, CA)
- …and technologies. + Experience in building large-scale operations capabilities in Site Reliability Engineering. Google Cloud's software engineers develop the ... on and is growing every day. As a software engineer , you will work on a specific project critical...scaling from small to large deployments. As a Technical Lead , you will define the operations engineering strategy for… more
- Safran (Carson, CA)
- Engineer II, RMS ( Reliability , Maintainability, Safety) Company : Safran Cabin Job field : Architecture and systems engineering Location : Carson , California , ... and Failure Modes and Effects Summary per MIL-STD-1629A and D6-56674. -Prepare Engineering Reliability Parts Prediction Count Reports (ERPPC) RMS Engineer II is… more