- Electric Power Research Institute (Palo Alto, CA)
- **Job Title:** Fuel Reliability Principal Team Lead **Location:** Charlotte, NC, Palo Alto, CA **Job Summary and Description:** The position is for an individual ... demonstrated with experience and activities related to nuclear fuel operation, reliability and performance + Understanding of current fuel operation technical issues… more
- NVIDIA (Santa Clara, CA)
- …aspect of the network infrastructure, ensuring its high availability and reliability . + Partnering with architecture and deployment teams to guarantee that ... + Minimum of 8 years of industry experience in network site reliability engineering, network automation, network operations, or related areas. Experience on both… more
- MongoDB (San Francisco, CA)
- …multi-cloud-provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems . The Fabric team manages the infrastructure that enables ... secure communication between systems and from the public internet. Their responsibilities encompass...region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to… more
- MongoDB (San Francisco, CA)
- …multi-cloud-provider Kubernetes infrastructure, deployment machinery, and observability and alerting systems . The Fabric team manages the infrastructure that enables ... secure communication between systems . Their responsibilities encompass network architecture, service mesh, and...accommodation. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) Lead with a strong networking background… more
- Google (Sunnyvale, CA)
- …+ 7 years of experience building and developing infrastructure, distributed systems , or networks, or experience with compute technologies, storage, or hardware ... + Experience in building large-scale operations capabilities in Site Reliability Engineering. Google Cloud's software engineers develop the next-generation… more
- ServiceNow, Inc. (Santa Clara, CA)
- …of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and ... **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the design,.../Splunk/ GitLab CI); + Strong working experience operating distributed systems built on Linux and J2EE; + Experience with… more
- LiveRamp (San Francisco, CA)
- …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work in close...code, and automate routine tasks** + **Experience with securing systems in a public cloud environment** + **Understands how… more
- Palo Alto Networks (Santa Clara, CA)
- …champion SRE best practices, and work collaboratively to ensure our systems are robust and performant. This includes automation, architecture, performance, ... observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab...and Dev teams to support critical business and production systems + Lead root cause analysis of critical business… more
- Tarana Wireless (Milpitas, CA)
- …bridging the digital divide in ways previously thought impossible. As a Senior Site Reliability Engineer, you will help us manage software that runs on the cloud and ... environment, to support millions of connected devices + Monitoring of all live systems + Troubleshoot and triage production active issues What You'll Need: + BS… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer. At NVIDIA, you'll be part of the team shaping the future of computing and guaranteeing ... techniques and Infrastructure as Code (IaC). + Deep understanding of Linux operating systems and TCP/IP fundamentals. + Expertise with at least one major cloud… more