- Palo Alto Networks (Santa Clara, CA)
- …insights into our systems ' performance and health. **Your Impact** As a Senior SRE with the Cortex Cloud Security Posture Management team, you will: + Cloud ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more
- ServiceNow, Inc. (San Diego, CA)
- …improve the reliability and performance of the infrastructure through improved system design. + Drive a culture of intolerance to manual activity which results ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
- NVIDIA (Santa Clara, CA)
- …foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big picture of how ... our systems relate to each other, we use a breadth...comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Develop, define… more
- The Walt Disney Company (Sacramento, CA)
- …knowledge in system management languages (eg Terraform, Ansible) + Operating systems and systems management (eg Amazon Linux, Windows) + **Multiple scripting ... of the team that provides cutting edge film making systems in the public cloud, focused on automation and...availability, and clear observability + Maintain and improve the reliability of services and infrastructure + Troubleshoot and resolve… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- Coinbase (Sacramento, CA)
- …and fully supported. Coinbase is hiring! We are looking for an experienced Site Reliability Engineer (SRE) to join the IT Operations Corporate Engineering team ... platform - and with it, the future global financial system . To achieve our mission, we're seeking a very...* Define metrics and bolster monitoring/observability across corporate IAM systems * Participate in regular on-call rotation to ensure… more
- Rubrik (Palo Alto, CA)
- … and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance ... enable teams at Rubrik to develop secure software and protect data and systems with appropriate security controls. Information Security also develops systems to… more
- LiveRamp (San Francisco, CA)
- …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work in close...code, and automate routine tasks** + **Experience with securing systems in a public cloud environment** + **Understands how… more
- Lumen (Sacramento, CA)
- …digitally connect the world and shape the future. **The Role** We are looking for a Senior Site Reliability Engineer (SRE)/ Platform Engineer / DevOps ... (AWS EKS) with a focus on networking, scalability, security, and reliability . Troubleshoot complex, cross- system issues involving Kubernetes, databases,… more