• Staff Site Reliability Engineer - Data & ML

    Datavant (Richmond, VA)
    …for healthcare. **What We're Looking For** We're looking for a **Senior Site Reliability Engineer ** to join our Data & ML Platform team. You'll be at the forefront ... environments. Drive initiatives around failover, autoscaling, chaos testing, and capacity planning. + **Advance Observability** : Build and maintain platform-wide… more
    Datavant (04/18/25)
    - Related Jobs
  • Sr. Hardware Reliability Engineer

    Amazon (Herndon, VA)
    …modeling and data analytics. During sustaining stage, candidate will be responsible for monitoring product performance in the field and will be responsible to ... Description As an Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter… more
    Amazon (04/25/25)
    - Related Jobs
  • Sr Engineer , Application Development…

    Cardinal Health (Richmond, VA)
    …to production outages. + Analyzes production system operations using tools such as monitoring , capacity analysis and outage root cause analysis to identify and ... Health_** We have a career opening for a Sr Engineer _of Rebates & Incentives for our Pharma IT...change that ensures continuous improvement in system stability and performance . + Demonstrates knowledge of software development, life cycle,… more
    Cardinal Health (04/11/25)
    - Related Jobs
  • Critical Environment Electrical Engineer

    Microsoft Corporation (Boydton, VA)
    …culture every day and we need you as a **Critical Environment Electrical Engineer .** Microsoft's **Cloud Operations & Innovation (CO+I)** is the engine that powers ... our cloud services. As a CO+I Electrical Engineer , you will perform a key role in delivering...infrastructures throughout datacenter campus ranging from a single large capacity facility, to several smaller ones. + Work with… more
    Microsoft Corporation (04/08/25)
    - Related Jobs
  • Commissioning Engineer , AMER-East ACx

    Amazon (Herndon, VA)
    Description As a Data Center Commissioning Engineer (CxE), you will be part of highly creative, efficient team tasked with tackling fascinating and challenging ... / VFD) -Chilled Water Systems -Building Management systems (BMS) -Electrical Power Monitoring Systems (EPMS) -Testing and balancing -Pumps and Hydronic systems AWS… more
    Amazon (02/06/25)
    - Related Jobs
  • Enterprise Application Engineer , Supply…

    GE Aerospace (Glen Allen, VA)
    …outages severely impacting consumers * Establish performance baseline, capacity thresholds, correlate events, and define monitoring /alerting criteria * ... to resource performance and leverage data to predict overall system capacity while automating load placement to avoid outages * Identify thresholds for all… more
    GE Aerospace (04/30/25)
    - Related Jobs
  • Data Center Controls Engineer

    Amazon (Herndon, VA)
    …the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience ... and maintaining the building management system (BMS) and electrical power monitoring system (EPMS). Using Amazon leadership principles, you will develop new… more
    Amazon (05/01/25)
    - Related Jobs
  • Facility Operations Center Engineer , ADC…

    Amazon (Arlington, VA)
    …the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience ... Facility Operations Center. The Facility Operations Center is responsible for 24X7 monitoring of the data center's physical infrastructure. This team will ensure… more
    Amazon (04/29/25)
    - Related Jobs
  • Network Engineer (Levels 2, 3)

    CACI International (Springfield, VA)
    …+ Evaluate and report on new/emerging network/communication technologies to enhance the capacity , performance , and reliability of the network. + Evaluate and ... Network Engineer (Levels 2, 3) Job Category: Information Technology...**More About the Role:** + Coordination of system maintenance, monitoring , and installation of multiple WAN/LAN environments encompassing multiple… more
    CACI International (02/11/25)
    - Related Jobs
  • Senior Site Reliability Engineer -FedRAMP…

    Cisco (VA)
    …disaster recovery, backup/restore, RTO, RPO + Chaos engineering + Application uptime and performance + Capacity management & planning + SLIs, SLOs, error ... to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user interfaces, in… more
    Cisco (03/14/25)
    - Related Jobs