- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- The Walt Disney Company (Sacramento, CA)
- …knowledge in system management languages (eg Terraform, Ansible) + Operating systems and systems management (eg Amazon Linux, Windows) + **Multiple scripting ... of the team that provides cutting edge film making systems in the public cloud, focused on automation and...availability, and clear observability + Maintain and improve the reliability of services and infrastructure + Troubleshoot and resolve… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and… more
- Rubrik (Palo Alto, CA)
- … and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance ... enable teams at Rubrik to develop secure software and protect data and systems with appropriate security controls. Information Security also develops systems to… more
- LiveRamp (San Francisco, CA)
- …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work in close...code, and automate routine tasks** + **Experience with securing systems in a public cloud environment** + **Understands how… more
- Insight Global (Santa Clara, CA)
- …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... and Driverless Cars to cater to their infrastructure & systems needs. As an SRE, youll also be working...Science, Information Technology, or related field, or equivalent experience. - System admin and Windows admin experience in an on… more
- MongoDB (San Francisco, CA)
- …to build next-generation, AI-powered applications. We are looking for an experienced Staff Engineer for our SRE, InfraSec team, to guide the security of our ... on security work, with ideally 2+ years in a senior or staff engineering role Security Mindset: + A...low-level fundamentals, and how they work together in complex systems Communication and Leadership Skills: + Strong ability to… more
- NVIDIA (Santa Clara, CA)
- GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader ... be doing: + Cultivate a top-performing team of Network Site Reliability Engineers through encouraging a culture of collaboration, accountability, and technical… more
- Rubrik (Palo Alto, CA)
- …we want to talk to you! **About The Role:** Sr . Site Reliability Engineers at Rubrik are systems /software engineers who ensure that Rubrik's infrastructure ... our customers + Design, implement and maintain relational database systems for performance and reliability + Manage...years of experience as a Development, DevOps or Site Reliability Engineer Willing to provide 24/7 coverage… more