- Coinbase (Sacramento, CA)
- …fully supported. Coinbase is hiring! We are looking for an experienced Site Reliability Engineer ( SRE ) to join the IT Operations Corporate Engineering team to ... and maintain CI/CD pipelines for integrating changes and deploying to production in progressively tested environments * Deliver configurations and maintain state… more
- NVIDIA (Santa Clara, CA)
- …and trouble-shooting of compute hardware and networking equipment. As a software engineer , you will work with other software engineers, product architects, and ... code - from development to commit to test to production , including operational support. We expect you to be...communication protocols (mutual-TLS, IPsec, or similar). + Knowledge of SRE principles (observability, SLOs, logging, etc.) Ways to stand… more
- NVIDIA (Santa Clara, CA)
- …Engineers with previous experience building and running private and public clouds at production scale. As part of the DGX Cloud team, you'll have the opportunity ... at scales requiring fully automated management and under active customer consumption in production . + A track record demonstrating a mix of initiating your own… more
- Aeris Communications (San Jose, CA)
- …manner. Collaborate actively with other developers and other cross-functional teams like QA, SRE , and Operations. Assist in support of the existing code in ... production environments. Key Responsibilities + Investigate and evaluate advanced technologies, protocols, and architectures to identify scalable and efficient… more
- Broadcom (Palo Alto, CA)
- …the primary needs, technical challenges, and problems you will be responsible for? As a Senior Engineer of the VKS cluster management team, we expect you to: + ... The VKS cluster management team is looking for a Senior Kubernetes Engineer with deep expertise in...around backup health, coverage, and performance. + Partner with SRE , DevOps, and application teams to ensure backup strategies… more
- TP-Link North America, Inc. (Irvine, CA)
- …and more reliable connectivity. We're looking for a passionate and experienced Site Reliability Engineer to join our team and play a crucial role in ensuring our ... Development and DevOps teams. + Help analyze and resolve production risks caused by insufficient resources, such as node...and tools. + Participate in mentoring and training less senior members of the team. + Be part of… more
- The Walt Disney Company (Glendale, CA)
- …across all media platforms. **Job Summary:** We're looking for a Principal Platform Engineer , Infrastructure & Tooling for Services, Data, and GenAI to help shape ... scalable, and built to accelerate delivery without compromising quality. As a senior technical expert, you'll drive the architecture and evolution of our AWS-based… more
- PennyMac (Westlake Village, CA)
- …firm with a comprehensive mortgage platform and integrated business focused on the production and servicing of US mortgage loans and the management of investments ... team of Site Reliability Operations Engineers across all levels (1,2,3, & Senior ). Foster a culture of excellence, collaboration, and continuous learning while… more