• Senior Storage Production Engineer - DGX Cloud

    NVIDIA (Santa Clara, CA)
    …Doing: + Design, implement, and support large-scale storage clusters, ensuring scalability, high availability , and data integrity. + Develop and maintain storage ... that involves designing, building, and maintaining large-scale production systems with high efficiency and availability . It encompasses various areas, including… more
    NVIDIA (01/03/26)
    - Related Jobs
  • Director of Cloud & Infrastructure - (Herndon, VA)

    Serco (Washington, DC)
    …models for infrastructure investments to support corporate EBITDA and margin objectives. ** High Availability , Disaster Recovery & Monitoring** + Ensure high ... objectives. + Lead modernization of hybrid-cloud integration (AWS, Azure Gov, GCC- High ), data centers and network architecture redesign. + Represent the Cloud… more
    Serco (01/03/26)
    - Related Jobs
  • Cluster Operations Leader, ADC Data Center Ops

    Amazon (Manassas, VA)
    …for the Amazon Dedicated Cloud business. This organization operates rapidly-scaling, high - availability data centers supporting the US government. They execute ... Operations must be technically adept, operationally agile, and completely committed to availability . The Sr. Manager will be responsible for ensuring standards for… more
    Amazon (12/23/25)
    - Related Jobs
  • Site Reliability Developer 4

    Oracle (Annapolis, MD)
    …multiple services (Open to work in shifts & shows flexibility) Maintain Service High Availability Release Management Test and Deploy solutions and automate to ... Build and maintain deployment tools/procedures Zero downtime deployments and a high availability mindset Define and build innovative solution methodologies… more
    Oracle (12/04/25)
    - Related Jobs
  • Splunk Security Engineer (TS/SCI) (Ft. Belvoir,…

    SMX (Fort Belvoir, VA)
    …health of the Splunk system, identify issues, and implement solutions to maintain high availability and performance. + Optimize queries, alerts, and settings to ... respond to SLA breaches or data ingest issues. + Disaster Recovery and High Availability : + Design and implement disaster recovery and high availability more
    SMX (01/02/26)
    - Related Jobs
  • Sr Engineer, Storage/Data Protection - IT…

    Guthrie (Sayre, PA)
    …and administrative functions for The Guthrie Clinic (TGC). This role ensures high availability and performance for storage arrays and data protection ... and storage orchestration tools. + Experience with backup and recovery, high availability and disaster recovery functions. + Experience with ITSM functionalities… more
    Guthrie (11/07/25)
    - Related Jobs
  • Senior Critical Environment Technician (Training)

    Microsoft Corporation (Mount Pleasant, WI)
    …3+ years mission critical services work/applied learning experience (eg, high availability assembly/manufacturing/critical infrastructure environments such as ... equivalent AND 5+ years mission critical services experience (eg, high - availability assembly/manufacturing/critical infrastructure environments such as data… more
    Microsoft Corporation (01/06/26)
    - Related Jobs
  • Observability Engineer (US)

    TD Bank (Mount Laurel, NJ)
    …systems, making them scalable, reliable, and efficient while ensuring performance and high availability of products/services + Ensures availability , latency, ... understanding of distributed and cloud native systems engineering including scalability, high availability , and performance optimization + Strong hands on… more
    TD Bank (01/06/26)
    - Related Jobs
  • Director of AI SRE & DevOps, AI.x

    Charles Schwab (San Francisco, CA)
    …with internal stakeholders to champion frequent and low-risk changes to maintain high availability and quality. **What you have** **Required Qualifications** + ... CI/CD pipelines. + 3+ years of experience leading SRE teams in high - availability hybrid-cloud environments. + Strong people management skills, including hiring,… more
    Charles Schwab (12/06/25)
    - Related Jobs
  • Site Reliability Engineer

    Trellix (Frisco, TX)
    …responsible for monitoring, maintaining and troubleshooting operational issues of a high availability production environment. **Job Summary:** The Site ... responsible for monitoring, maintaining and troubleshooting operational issues of a high availability production environment. The SRE will also act as a bridge… more
    Trellix (11/27/25)
    - Related Jobs