- Oracle (Sacramento, CA)
- **Job Description** The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable our ... support for events. Serve as the escalation point for event remediation. Lead post-event root cause analysis. Join major event/incident calls, use technical and… more
- Oracle (Sacramento, CA)
- **Job Description** The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable our ... support for events. Serve as the escalation point for event remediation. Lead post-event root cause analysis. + Join major event/incident calls, use technical… more
- IBM (San Jose, CA)
- …growth and innovation thrive. . **Your role and responsibilities** As a Site Reliability Engineer, you will work in an agile, collaborative environment to build, ... and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to… more
- NVIDIA (Santa Clara, CA)
- …cloud. Join us in this exciting endeavor! What You Will Be Doing: + Lead initiatives to transform IT Compute Core Team, architecture to build new service offerings ... NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity… more
- General Motors (Sunnyvale, CA)
- …Groovy + On-call and fire-fighting experience + Experience with modern site reliability practice including but not limited to post mortem, SLO/SLI, Tracing, ... Zero Crashes, Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better, safer and more equitable for… more
- Palo Alto Networks (Santa Clara, CA)
- …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... infrastructure across multi-cloud environments for our federal customers. + ** Lead cross-functional initiatives** to ensure applications are production-ready, scalable,… more
- LinkedIn (Mountain View, CA)
- …the application layer. We do this with a focus on performance, security, and reliability . As a Sr. Staff Software Engineer, you will fill the mission-critical role ... operating large-scale systems. Responsibilities: + You will function as the technical lead for multiple key initiatives, identify problems & opportunities and … more
- Walmart (Sunnyvale, CA)
- …that empower internal teams through intuitive, self-service automation. + Lead platform reliability practices-ensuring high availability, robust observability, ... available in this role". **What You'll Do** + **Define and lead architecture** for core infrastructure automation components, including orchestrators, workflow… more
- NVIDIA (Santa Clara, CA)
- …and board designers, software/firmware engineers, HW/SW applications engineering, process/ reliability specialists, DFx engineers, ATE engineers, product managers, ... to complex silicon and system level problems and be on the frontline to lead show-stopper bugs, in order to enable product shipment. + Collaborating with the best… more
- Oracle (Sacramento, CA)
- …large-scale, distributed cloud infrastructure systems that meet high standards for reliability , performance, and security. + Lead technical architecture ... cloud-scale infrastructure systems that power tomorrow's enterprise solutions. + Lead technical problem-solving and hands-on implementation across the full stack,… more