• Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …+ Lead initiatives to transform IT Compute Core Team, architecture to build new service offerings across On-Prem and Cloud + You will design, scale, and deploy core ... NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity… more
    NVIDIA (08/21/25)
    - Related Jobs
  • Linux Site Reliability Engineer

    Nutanix (Sacramento, CA)
    …or specific events. **Your Role** + Ensure the 24/7 availability and reliability of Nutanix's cloud services and infrastructure. + Respond promptly to alerts ... + Participate in on-call rotation to provide after-hours support and maintain service level agreements (SLAs). + Develop and enhance automation scripts using… more
    Nutanix (09/24/25)
    - Related Jobs
  • Site Reliability Engineer II

    RELX INC (Sacramento, CA)
    …with modern multi cloud platforms and cutting-edge tools to enhance system reliability , visibility, and security across the entire development lifecycle. If you are ... + Monitoring & Observability: Create and optimize monitoring queries; establish service level baselines. + Incident Response: Support senior engineers during… more
    RELX INC (10/15/25)
    - Related Jobs
  • Principal Site Reliability Engineer

    Palo Alto Networks (Santa Clara, CA)
    …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... + **Expert troubleshooting skills** to resolve cloud infrastructure and service issues, effectively identifying root cause and devising effective solutions.… more
    Palo Alto Networks (10/07/25)
    - Related Jobs
  • Principal Site Reliability Engineer

    Palo Alto Networks (Santa Clara, CA)
    …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... PKI concepts + Expertise in troubleshooting and resolving cloud infrastructure and service issues, identifying root cause and devising effective solutions for high… more
    Palo Alto Networks (09/06/25)
    - Related Jobs
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew that develops and maintains ... ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by… more
    Insight Global (09/09/25)
    - Related Jobs
  • Staff Software Engineer

    LinkedIn (Mountain View, CA)
    …to you + Having interviews in an accessible location + Being accompanied by a service dog + Having a sign language interpreter present for the interview A request ... for an accommodation will be responded to within three business days. However, non-disability related requests, such as following up on an application, will not receive a response. LinkedIn will not discharge or in any other manner discriminate against… more
    LinkedIn (10/10/25)
    - Related Jobs
  • Reliability Engineering Manager

    Teledyne (El Segundo, CA)
    …or systemic issues to senior leadership. **Supervisory Responsibilities** Directly manage the Reliability Department Staff: Reliability Engineer (s) and ... Event Upset) assessments. + Functional and design-level FMEA. + Support Entry-into- Service (EIS) reliability planning and performance tracking. + **Customer… more
    Teledyne (09/23/25)
    - Related Jobs
  • Solutions Engineering Manager - Reliability

    Siemens (Sacramento, CA)
    …to strategically planned modernization - we ensure your systems' highest reliability and availability: 100% Railability. We are constantly developing new, ... usage and create a new quality of travel. Good service means we are there for our partners and...third-party rolling stock. Our mission is to ensure maximum reliability , availability, and performance of our customers' fleets through… more
    Siemens (10/03/25)
    - Related Jobs
  • Senior Manager, Network Site Reliability

    NVIDIA (Santa Clara, CA)
    GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader ... and decrease Mean Time to Recovery (MTTR), improving overall service reliability and user satisfaction. + Work...Artificial Intelligence, and Autonomous Vehicles. If you're a creative engineer who enjoys autonomy and shares our passion for… more
    NVIDIA (08/08/25)
    - Related Jobs