- JPMorgan Chase (Palo Alto, CA)
- …You've discovered the perfect environment to have a major impact. As a **Principal Site Reliability Engineer ** at JPMorgan Chase within the **Enterprise ... capabilities, and skills** + Formal training or certification on site reliability engineering concepts and 10+ years...and leads partnerships across job functions to develop efficient systems . + Engages team members and expresses complex ideas… more
- General Motors (Mountain View, CA)
- …+ Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability ... our customers, including fleet management, energy optimization, transportation logistics, safety systems , and more. To fulfill our mission, we are actively expanding… more
- Cornerstone onDemand (Dublin, CA)
- We are seeking a highly skilled Site Reliability Engineer with 3 years of experience to join our dynamic team. The ideal candidate will have a strong ... on designing, implementing, and managing cloud-based solutions. As a Site Reliability Engineer , you will...+ Maintain operational run book procedures for all production systems and document the knowledge base. + Administer incident… more
- NVIDIA (Santa Clara, CA)
- …drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big ... picture of how our systems relate to each other, we use a breadth...troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Develop, define and document standard… more
- PennyMac (Westlake Village, CA)
- …quickly and accurately, is critical to the success of anyone in this role. The Engineer III, Site Reliability Operations will: + Monitoring - Oversee 24/7 ... journey. A Typical Day As a member of the Site Reliability Operations (SRO) team, you will...timely and accurate resolution of service disruptions + Advanced Systems Administration - Perform and troubleshoot a wide range… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency ... health + Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity +… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency ... health. + Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity +… more
- NVIDIA (Santa Clara, CA)
- …impact on the world. NVIDIA is looking to hire a deeply technical and creative Site Reliability Engineer to build, support and maintain the next generation ... challenges, automate processes, and iterate for efficiency + Tackle systemic reliability issues with multi-functional teams. + Monitor, optimize, and manage system… more
- Palo Alto Networks (Santa Clara, CA)
- …runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer , you will be part of a team supporting the ... the Kubenetes cluster with autoscaling enabled + Experience in Production Engineering, DevOps, or Site Reliability + Expertise in the public cloud (GCP or AWS),… more
- Palo Alto Networks (Santa Clara, CA)
- …and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering + DevOps/SRE Expertise - 7+ years of experience ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more