- ServiceNow, Inc. (San Diego, CA)
- …improve the reliability and performance of the infrastructure through improved system design. + Drive a culture of intolerance to manual activity which results ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and… more
- NVIDIA (Santa Clara, CA)
- …actualize the SRE vision for our network infrastructure. We are looking for an engineer who is passionate about the network and making its operation seamless with a ... be focused on user satisfaction and brilliance in Network Operations. This SRE engineer will focus on tackling significant projects and is committed to fostering a… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- Palo Alto Networks (Santa Clara, CA)
- …including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you ... to develop innovative solutions that provide clear and actionable insights into our systems ' performance and health. **Your Impact** As a Senior Staff SRE with the… more
- NVIDIA (Santa Clara, CA)
- …Will Be Doing: + Architect, lead, and scale globally distributed production systems supporting AI/ML, HPC, and critical engineering platforms across hybrid and ... that reduce manual tasks, promote resilience, and uphold standard methodologies for system health, change safety, and release velocity. + Define and evolve… more
- The Walt Disney Company (Sacramento, CA)
- …knowledge in system management languages (eg Terraform, Ansible) + Operating systems and systems management (eg Amazon Linux, Windows) + **Multiple scripting ... of the team that provides cutting edge film making systems in the public cloud, focused on automation and...availability, and clear observability + Maintain and improve the reliability of services and infrastructure + Troubleshoot and resolve… more
- Celonis (Redwood City, CA)
- …engineering and Site Reliability Engineering (SRE) principles to drive system reliability , scalability, and operational excellence across the organization. ... join us. **The Team** As a member of our Reliability Engineering team, you will play a critical role...providers (AWS, Azure, or GCP) and modern cloud monitoring system observability frameworks (eg, Datadog). + Working knowledge developing… more
- NVIDIA (Santa Clara, CA)
- …SOL quality and efficiency. The DFP team is looking for a Speed and Reliability Lead. You will be leading and crafting testability features related to Speed, Timing ... and Reliability from ground up as you help turbocharge NVIDIA's...with the best minds in NVIDIA across various teams ( System Architecture, PDE, Application Engineering, Product Manager, Sales, Operations)… more