- NVIDIA (Santa Clara, CA)
- …systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and blameless postmortems + Be part of an on call ... NVIDIA is looking for an outstanding, passionate, and talented Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will design,… more
- Oracle (Sacramento, CA)
- …logging, and auditability. + **Reliability Engineering:** Skills in monitoring, alerting, incident response , and root cause analysis for highly-available, ... **Job Description** We are seeking a skilled and motivated Engineer to join Oracle Base Database Service team. The... to join Oracle Base Database Service team. The Engineer will be responsible for designing, deploying, maintaining, and… more
- Coinbase (Sacramento, CA)
- …operational procedures, and troubleshooting steps across system lifecycle * Facilitate incident response , conduct root cause analysis, and blameless ... and fully supported. Coinbase is hiring! We are looking for an experienced system engineer (SE) to join the IT Operations Corporate Engineering team to build and… more
- NVIDIA (Santa Clara, CA)
- …multi-cloud and hybrid (on-prem + cloud) environments, implementing monitoring, alerting, and incident response protocols. + Participate in on-call rotation to ... generative AI to autonomous vehicles. We are now looking for a ML Platform Engineer to help accelerate the next era of machine learning innovation. In this role,… more
- NVIDIA (Santa Clara, CA)
- …systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and blameless postmortems + Be part of an on call ... rotation to support production systems What we need to see: + BS degree in Computer Science or a related technical field involving coding (eg, physics or mathematics), or equivalent experience + 8+ years of experience with Infrastructure automation,… more
- NVIDIA (Santa Clara, CA)
- …encryption, access controls, and auditing mechanisms for storage systems. + Practice sustainable incident response and blameless root cause analysis. Be part of ... an on-call rotation to support storage and production systems. What We Need To See: + BS degree or equivalent experience in Computer Science, Storage Systems, or a related technical field with 8+ years of practical experience. + Experience with distributed and… more
- CVS Health (Sacramento, CA)
- …across multiple regions and environments (cloud, on-premises, colocation). + Develop incident response and recovery strategies. **Required Qualifications:** + 5+ ... years of experience in developing and deploying security technologies. + 5+ years of experience with modern Software Development Lifecycles and CI/CD practices, including pipeline automation and security integration. + 3+ years of experience with remediation… more
- Oracle (Sacramento, CA)
- …scalability, performance, and operational efficiency + Participate in on‑call and incident response , drive root cause analysis, and preventive actions ... + Collaborate with cross‑functional teams to deliver end‑to‑end infrastructure solutions + Mentor and help onboard new team members Disclaimer: **Certain US customer or client-facing roles may be required to comply with applicable requirements, such as… more
- Oracle (Sacramento, CA)
- …experience on a major public cloud, including observability, orchestration, and incident response . BS/MS in Computer Science, Electrical/Computer Engineering, or ... equivalent practical experience; proven technical leadership and mentoring. Preferred: Familiarity with high-performance IO paths; understanding of cross-region networking and latency trade-offs. Strong foundation in consensus and transactions. Expertise with… more
- NVIDIA (Santa Clara, CA)
- …systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and blameless postmortems + Be part of an on call ... rotation to support production systems What we need to see: + BS degree in Computer Science or a related technical field involving coding (eg, physics or mathematics), or equivalent experience. + 10+ years of experience. + Experience with Infrastructure… more