• Database Engineer III, Site

    PennyMac (Westlake Village, CA)
    …through the complete mortgage journey. A Typical Day As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7 monitoring and ... support of Pennymac's database infrastructure and related systems . This role focuses specifically on database operations, performance optimization, and ensuring the … more
    PennyMac (09/11/25)
    - Related Jobs
  • Senior Site Reliability

    Rubrik (Sacramento, CA)
    … and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance ... enable teams at Rubrik to develop secure software and protect data and systems with appropriate security controls. Information Security also develops systems to… more
    Rubrik (08/20/25)
    - Related Jobs
  • Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... analyze capacity data and develop plans for appropriate level enterprise-wide systems , and coordinate with management personnel in implementing changes. + Develop… more
    NVIDIA (08/21/25)
    - Related Jobs
  • Linux Site Reliability

    Nutanix (Sacramento, CA)
    …team plays a crucial role in ensuring the smooth operation of critical systems , leveraging cutting-edge technologies and automation to achieve our goals. You will ... Our work setup is hybrid, requiring you to be on- site three days a week while giving you the...events. **Your Role** + Ensure the 24/7 availability and reliability of Nutanix's cloud services and infrastructure. + Respond… more
    Nutanix (09/24/25)
    - Related Jobs
  • Reliability Engineer

    C&W Services (Hesperia, CA)
    …and Safety, to optimize building systems , maintenance, and project implementation. The Reliability Engineer will also work closely with the Controls System ... **Job Title** Reliability Engineer **Job Description Summary** **Job...teams' goals. **Key Responsibilities:** + **Inventory Management** : Manage site Inventory Management System (IMS) Team and ensure effective… more
    C&W Services (10/04/25)
    - Related Jobs
  • Principal Site Reliability

    Palo Alto Networks (Santa Clara, CA)
    …champion SRE best practices, and work collaboratively to ensure our systems are robust and performant. This includes automation, architecture, performance, ... observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab...and Dev teams to support critical business and production systems + Lead root cause analysis of critical business… more
    Palo Alto Networks (09/06/25)
    - Related Jobs
  • Senior Site Reliability

    LiveRamp (San Francisco, CA)
    …issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and ... Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work in close...code, and automate routine tasks** + **Experience with securing systems in a public cloud environment** + **Understands how… more
    LiveRamp (08/07/25)
    - Related Jobs
  • Launch Reliability Engineer

    SpaceX (CA)
    Launch Reliability Engineer Vandenberg, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally ... ultimate goal of enabling human life on Mars. LAUNCH RELIABILITY ENGINEER The Launch Reliability ...is the advocate for mission assurance at the launch site by promoting efficient and reliable operational processes. You… more
    SpaceX (07/07/25)
    - Related Jobs
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew that develops and maintains ... and Driverless Cars to cater to their infrastructure & systems needs. As an SRE, you'll also be working...new products and manage our infrastructure, associated processes and systems . Keen attention to detail, problem-solving abilities, and a… more
    Insight Global (09/09/25)
    - Related Jobs
  • Intern 2026: Site Reliability

    IBM (San Jose, CA)
    …agile techniques **Preferred technical and professional experience** * Experience with Dell Systems platform management * Systems Engineer certification in ... The IBM Research ETE organization will provide the buildout of systems in the POK building 008 datacenter. Responsibilities would include installing… more
    IBM (09/19/25)
    - Related Jobs