• (USA) Senior Director, Site Reliability…

    Walmart (Sunnyvale, CA)
    …automation of disaster recovery certification, resiliency-as-a-service platforms, and large- scale incident management * Deep understanding of Hybrid cloud ... critical incidents. o Serve as the center of excellence for enterprise incident readiness, raising organizational resilience standards across Walmart Global Tech. *… more
    Walmart (11/13/25)
    - Related Jobs
  • Director, IT

    MongoDB (New York, NY)
    …enhance support processes to ensure efficiency and effectiveness **ITIL Best Practices and Incident Management :** + Collaborate with the ITSM Manager and GET ... Engineering to implement and enforce ITIL best practices related to incident management , change management , problem management , and continuous… more
    MongoDB (10/30/25)
    - Related Jobs
  • Director, Product Software Engineering | Platform…

    Wolters Kluwer (Cary, NC)
    …policy-as-code, CI/CD). . Establish site reliability practices-SLO/SLI, error budgets, incident management , post- incident reviews, and capacity/performance ... chain controls. . Observability platforms (metrics, logs, traces, SLOs, alerts) and incident management practices. . Coaching product teams; operating in a… more
    Wolters Kluwer (10/28/25)
    - Related Jobs
  • AVP, Technology Operations

    PennyMac (Westlake Village, CA)
    …of potential issues and complete visibility into system performance and health. + Incident Management - Oversee critical incident response processes, leading ... production and servicing of US mortgage loans and the management of investments related to the US mortgage market....to enhance operational efficiency and system reliability. + Team Management - Lead, mentor, and develop a team of… more
    PennyMac (10/23/25)
    - Related Jobs
  • Senior Cloud Architect/ Team Lead

    NTT America, Inc. (Plano, TX)
    …**Cloud Operations Management ** Provide oversight for the daily operations, incident management , change management , and problem resolution across ... and industry best practices. Collaborate with security teams for vulnerability management , cloud security compliance, and incident investigation. Be able… more
    NTT America, Inc. (10/17/25)
    - Related Jobs
  • Sr. Manager, Network Services- PVH Corp.

    PVH Corp. (Bridgewater, NJ)
    …the vendor + Acts as one of the final escalation points for the Incident Management process. + Ensures that his/her team provides efficient high-quality Network ... and able to handle different cultures + Deep understanding of IT Infrastructure management practices, including incident management , problem management ,… more
    PVH Corp. (10/10/25)
    - Related Jobs
  • DCEO Cluster Manager, AWS DC ops

    Amazon (Canton, MS)
    …to meet or exceed contracted performance SLA's. - Safety, security, and availability incident response, incident management , incident resolution, and ... an extraordinary individual with proven and tested leadership and management skills as a leader in our facilities operations...DCEO Cluster Manager is one of our most senior management roles in JAN. In this role, you will… more
    Amazon (10/03/25)
    - Related Jobs
  • Deskside Support Engineer

    Cognizant (Phoenix, AZ)
    …specialist to join our team. The ideal candidate will have expertise in IT Service Management Incident Management and End User Tools such as Nexthink. ... best practices. **Qualifications** + Possess strong expertise in IT Service Management and Incident Management . + Demonstrate proficiency in using end user… more
    Cognizant (12/12/25)
    - Related Jobs
  • Data Center Cluster Operations Leader, AMER MLZs

    Amazon (Fort Worth, TX)
    …maintenance task balancing internal skillsets with frugality. Safety, security, and availability incident response, incident management , and incident ... expanding Infrastructure Operations team. The Senior Manager is one of our most senior management roles in the data center environment. In this role, you will be… more
    Amazon (12/10/25)
    - Related Jobs
  • Senior Manager, Strategic Initiatives…

    AIG (Atlanta, GA)
    …+ Lead Enterprise Resiliency activities for GLCR, including business interruption and incident management response support, leading refreshes of business impact ... notifications, leads incident response for GLCR and supports incident management response processes, provide business and staff impact status. Leads annual… more
    AIG (12/10/25)
    - Related Jobs