- pony.ai (Fremont, CA)
- …Pony.ai went public at NASDAQ in November 2024. Responsibilities As a ( Senior ) Kubernetes Engineer, you will: + Design, operate, and optimize Kubernetes clusters ... security policies, and operational guidelines. + Contribute to observability and SRE practices to ensure reliability at scale (SLOs, incident reviews, metrics-driven… more
- MongoDB (Palo Alto, CA)
- …for long-term platform health + Orchestrate World-Class Observability: Collaborate with SRE and Solutions Architects to build advanced instrumentation strategies and ... to critical business outcomes + Unrivaled Influence: You earn the trust of senior engineers through technical depth and respect for complexity, not just title. You… more
- General Motors (Mountain View, CA)
- …bringing both advantages and challenges. As part of Site Reliability Engineering ( SRE ) at General motors, you'll join a dedicated team focused on enhancing ... This role is for a hands-on position as an Individual Contributor (IC). As an SRE IC, you will focus on enhancing the reliability, efficiency, and performance of our… more
- Oracle (Sacramento, CA)
- …eventing + Establish CI/CD, automated testing, metrics/logging/tracing, and SRE -aligned operations + Champion security, compliance, and privacy-by-default practices ... and eventing + Establish CI/CD, automated testing, metrics/logging/tracing, and SRE -aligned operations + Champion security, compliance, and privacy-by-default practices… more
- Walmart (Sunnyvale, CA)
- …+ Build reusable tools, library, dashboards which can be used across DevOps/ SRE teams **What you'll bring:** + Bachelor's degree in Computer Science, Engineering ... or related discipline + 5+ years of hands-on related to SRE , Operations ; Development experience with Java Script, Java, Restful services, Git, Maven, Jenkins,… more
- NVIDIA (Santa Clara, CA)
- …well as managing vendor relationships. You will partner with engineering, SRE , product, and third-party infrastructure providers to achieve operational excellence. ... operational excellence best practices across all infrastructure providers, partnering with SRE , infra, product, and security teams + Define and operationalize… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Technical Program Manager to lead the Infrastructure and Product Security and Compliance program for DGX Cloud. In this role, you will ... highest standards of trust, resilience, and governance. As a Senior TPM focused on Cloud Security, you will own...and processes, establishing security KPIs, dashboards, and "run safe" SRE practices. + Partner with the CISO organization to… more
- Charles Schwab (San Francisco, CA)
- …explore next-generation GenAI efforts that will redefine how we serve our clients. As a Senior AI Site Reliability Engineer on AI.x, you will play a key role in ... the most exciting areas of technology today. As a Senior AI Site Reliability Engineer, you will design, implement,...lead by example in solving complex reliability challenges, advancing SRE standards, and driving rapid iteration from concept to… more
- TP-Link North America, Inc. (Irvine, CA)
- …smarter, and more reliable connectivity. We're looking for a passionate and experienced Senior Site Reliability Engineer to join our team and play a crucial role ... and tools + Help to mentor and train less senior members of the team + Ability to be...Java, Python, Bash, or PowerShell. + Hands-on experience in SRE , DevOps, cloud operations, and cloud security best practices.… more
- Walmart (Sunnyvale, CA)
- …** **What you'll do ** We are seeking a talented and passionate **" Senior Engineering Manager"** for Membership Acquisition, Data & Reporting who will lead multiple ... + **Champion engineering and operational excellence** Implement and track DORA/OE/ SRE metrics, including deployment frequency, lead time, change fail rate,… more