- NVIDIA (Santa Clara, CA)
- …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a ... environments, to automate operational monitoring and alerting, and to enable self- service consumption of resources. + Document the general procedures and practices,… more
- Amazon (Cupertino, CA)
- …machine learning capabilities for our customers. We're seeking an experienced C/C++ engineer to join our embedded software team, where you'll develop bare metal ... operations experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as… more
- MongoDB (San Francisco, CA)
- …in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to join the Fabric ... and from the public internet. Their responsibilities encompass network architecture, service mesh, and edge load balancing, ensuring customer data remains safe… more
- Oracle (Sacramento, CA)
- …are solving complex problems in distributed systems, networking, multi-tenant Infrastructure-as-a- Service (IaaS), and Software Defined Networking (SDN) operating at ... for decomposing high-level architectures into detailed designs. Work with supporting service teams to ensure solutions are properly supported by monitoring and… more
- NVIDIA (Santa Clara, CA)
- …+ Lead initiatives to transform IT Compute Core Team, architecture to build new service offerings across On-Prem and Cloud + You will design, scale, and deploy core ... NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity… more
- LinkedIn (Mountain View, CA)
- …in Sunnyvale, CA or San Francisco, CA. **Responsibilities** + Serve as a senior technical leader driving the long-term reliability and observability strategy ... engineering teams to enable the right business decisions around improving quality and reliability of our services and products + Act as a force multiplier by… more
- Walmart (Sunnyvale, CA)
- …management, or related area., SRE certification (for example, IBM Cloud Site Reliability Engineer )., We value candidates with a background in creating ... ** **What you'll do ** **Location: Sunnyvale / Bentonville** **Department: Reliability Engineering / Business Reliability Engineering (BRE)** **Reports To:… more
- Insight Global (Santa Clara, CA)
- …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by… more
- Amazon (San Diego, CA)
- …highly reliable, power efficient, performant, low-cost satellite bus and payload. As a Sr . PCBA Manufacturing Engineer , you will work across design engineering, ... prioritize issues for whole program. You use data to drive manufacturability, reliability , and quality objectives in design and development teams. Key job… more
- Oracle (Pleasanton, CA)
- …standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, ... Knowledge of Oracle databases would be a plus. **Responsibilities** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a… more