- Coinbase (Sacramento, CA)
- …Q3 2023. *What you'll be doing (ie. job duties):* * Improve observability, reliability and availability by defining and measuring key metrics * Build automation and ... and automate incident response * Proactively find and analyze reliability problems across our business units and stack, then...and hold accountable the engineering team to improve the reliability of our systems and make reliability … more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Coinbase (Sacramento, CA)
- …and fully supported. Coinbase is hiring! We are looking for an experienced Site Reliability Engineer (SRE) to join the IT Operations Corporate Engineering team ... to build and scale our identity and access management tooling. A successful candidate will have demonstrated previous success in similar role(s) in rapidly growing, security-first environments. The right person is passionate about infrastructure as code, open… more
- Rubrik (Sacramento, CA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- LiveRamp (San Francisco, CA)
- …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Terraform scripts in support of the mission in close collaboration with DevOps team** + **Maintain and enhance Engineering Operational Documentation for supported products.** + **Provide expertise to build and maintain products operational documentation and… more
- General Motors (Mountain View, CA)
- …_Work Arrangement:_ _This role is categorized as hybrid. This means the successful candidate is expected to report to the office three times per week or_ _other_ ... _frequency dictated by the business._ _ _ **What** **You'll** **Do** + Leads and generates technical solutions including specifying of requirements, functional decomposition, analysis, development and testing for current, new and major programs + Lead… more
- NVIDIA (Santa Clara, CA)
- …that are groundbreaking in AI and computing. What you'll be doing: As a Reliability Methodology Engineer at NVIDIA, you will be responsible for ensuring our ... design, product, and test engineering teams to apply DFT methodologies to improve reliability screening specific to HTOL (Component level Hight Temp Op. Life Test).… more
- Rubrik (Palo Alto, CA)
- …+ Minimum 1-3 years of experience as a Development, DevOps or Site Reliability Engineer Willing to provide 24/7 coverage + Strong Documentation skills ... win, we want to talk to you! **About The Role:** Senior Software Engineers - Reliability at Rubrik are systems/software engineers who ensure that Rubrik's… more
- Palo Alto Networks (Santa Clara, CA)
- …actionable insights into our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: ... influence the operability of the product and ensure the reliability and availability of our services **Your Experience** +...DevOps/SRE Expertise: 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong… more