- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- Palo Alto Networks (Santa Clara, CA)
- …delivering and deploying applications to production + Build observation (logging, metrics, alerting) systems to make sure system works well. + Design and ... Citizen or Green Card holder.** **Your Career** We are seeking development-heavy Site Reliability Engineers (SREs) who are passionate about bringing new ideas to all… more
- Oracle (Sacramento, CA)
- …will be joining the OCSC (Oracle Cloud Service Centre) as an SRD (site reliability developer). Your job role will be helping Oracle ensure the availability of cloud ... experiencing both development and operations. As a Cloud Service Centre Site Reliability Developer Intern you will be involved with: **Operations** + Administer… more
- NVIDIA (Santa Clara, CA)
- …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... like eBPF and XDP for Observability & DDoS mitigation + Collect and review system data for capacity and planning purposes, analyze capacity data and develop plans… more
- Oracle (Sacramento, CA)
- **Job Description** The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable our ... network monitoring and telemetry solutions. + Experience with Ticket systems like Jira, and Version control systems ...Ticket systems like Jira, and Version control systems like Git. + Knowledge of Scrum & Agile… more
- NBC Universal (Universal City, CA)
- … systems , responding to alerts, and resolving issues promptly. The engineer also oversees and improves complex telecommunications systems that support ... is expected to be completed during 2025. The Unified Communication Engineer at NBC Universal holds extensive responsibility across various Unified Communications… more
- Oracle (Sacramento, CA)
- …a significant technical and business impact designing and building innovative new systems to power our customer's business critical applications. This role offers ... smart people who are solving complex problems in distributed systems , networking, multi-tenant Infrastructure-as-a-Service (IaaS), and Software Defined Networking… more
- Palo Alto Networks (Santa Clara, CA)
- …champion SRE best practices, and work collaboratively to ensure our systems are robust and performant. This includes automation, architecture, performance, ... observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab CI/CD, GitOps, Prometheus,… more
- Insight Global (Santa Clara, CA)
- …Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew that develops and maintains ... and Driverless Cars to cater to their infrastructure & systems needs. As an SRE, you'll also be working...Science, Information Technology, or related field, or equivalent experience. - System admin and Windows admin experience in an on… more