• Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …once they are live by measuring and monitoring availability, latency and overall system health + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
    NVIDIA (12/19/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
    NVIDIA (11/05/25)
    - Related Jobs
  • Sr Staff Site Reliability Engineer

    Palo Alto Networks (Santa Clara, CA)
    …delivering and deploying applications to production + Build observation (logging, metrics, alerting) systems to make sure system works well. + Design and ... Citizen or Green Card holder.** **Your Career** We are seeking development-heavy Site Reliability Engineers (SREs) who are passionate about bringing new ideas to all… more
    Palo Alto Networks (12/12/25)
    - Related Jobs
  • Undergrad Site Reliability Engineer

    Oracle (Sacramento, CA)
    …will be joining the OCSC (Oracle Cloud Service Centre) as an SRD (site reliability developer). Your job role will be helping Oracle ensure the availability of cloud ... experiencing both development and operations. As a Cloud Service Centre Site Reliability Developer Intern you will be involved with: **Operations** + Administer… more
    Oracle (11/25/25)
    - Related Jobs
  • Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... like eBPF and XDP for Observability & DDoS mitigation + Collect and review system data for capacity and planning purposes, analyze capacity data and develop plans… more
    NVIDIA (11/20/25)
    - Related Jobs
  • Principal Network Reliability

    Oracle (Sacramento, CA)
    **Job Description** The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable our ... network monitoring and telemetry solutions. + Experience with Ticket systems like Jira, and Version control systems ...Ticket systems like Jira, and Version control systems like Git. + Knowledge of Scrum & Agile… more
    Oracle (12/01/25)
    - Related Jobs
  • Sr. Site Reliability Engineer

    NBC Universal (Universal City, CA)
    systems , responding to alerts, and resolving issues promptly. The engineer also oversees and improves complex telecommunications systems that support ... is expected to be completed during 2025. The Unified Communication Engineer at NBC Universal holds extensive responsibility across various Unified Communications… more
    NBC Universal (12/24/25)
    - Related Jobs
  • Senior Principal Network Reliability

    Oracle (Sacramento, CA)
    …a significant technical and business impact designing and building innovative new systems to power our customer's business critical applications. This role offers ... smart people who are solving complex problems in distributed systems , networking, multi-tenant Infrastructure-as-a-Service (IaaS), and Software Defined Networking… more
    Oracle (12/11/25)
    - Related Jobs
  • Sr Site Reliability Engineer (Prisma…

    Palo Alto Networks (Santa Clara, CA)
    …champion SRE best practices, and work collaboratively to ensure our systems are robust and performant. This includes automation, architecture, performance, ... observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab CI/CD, GitOps, Prometheus,… more
    Palo Alto Networks (12/12/25)
    - Related Jobs
  • Junior Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew that develops and maintains ... and Driverless Cars to cater to their infrastructure & systems needs. As an SRE, you'll also be working...Science, Information Technology, or related field, or equivalent experience. - System admin and Windows admin experience in an on… more
    Insight Global (12/07/25)
    - Related Jobs