- Walmart (Sunnyvale, CA)
- …SLAs/SLOs, automating reliability frameworks (chaos engineering, self-healing), and leading incident management at scale. + **Innovation Driver:** History ... canary analyses, and self-healing mechanisms. + Lead post-mortem investigations for major incidents, driving root-cause analysis and systemic remediation to prevent… more
- Meta (Fremont, CA)
- …Skills:** Network Engineer, Operations and Support (Labs) Responsibilities: 1. Incident Response: Drive work investigating complex technical and process issues ... and continuity disciplines for infrastructure spanning thousands of locations during major incidents/site events on edge, caching, and network infrastructure. This… more
- PagerDuty (San Francisco, CA)
- …The PagerDuty Operations Cloud combines AIOps, Automation, Customer Service Operations and Incident Management with a powerful generative AI assistant to create ... PagerDuty, Inc. (NYSE:PD) is a global leader in digital operations management . Half of the Fortune 500 and nearly 70% of the Fortune 100 trust PagerDuty as essential… more
- V2X (El Segundo, CA)
- …of BIM efforts. This mid-level role ensures ServiceNow applications - including Incident , Change, Asset, and Configuration Management - are properly aligned ... data integrity, and user support while helping to optimize IT service management (ITSM) and IT operational management (ITOM) capabilities. Position will… more
- PagerDuty (San Francisco, CA)
- …The PagerDuty Operations Cloud combines AIOps, Automation, Customer Service Operations and Incident Management with a powerful generative AI assistant to create ... PagerDuty, Inc. (NYSE:PD) is a global leader in digital operations management . Half of the Fortune 500 and nearly 70% of the Fortune 100 trust PagerDuty as essential… more
- Amgen (Thousand Oaks, CA)
- …substance manufacturing. + Serve as an experienced advisor to senior level management and ensure the automation solutions & strategy development aligns with ... Manufacturing Process requirements. + Strategic and tactical management and leadership in providing 24x7 day-to-day operational support and capital project support.… more
- PagerDuty (San Francisco, CA)
- …The PagerDuty Operations Cloud combines AIOps, Automation, Customer Service Operations and Incident Management with a powerful generative AI assistant to create ... PagerDuty, Inc. (NYSE:PD) is a global leader in digital operations management . Half of the Fortune 500 and nearly 70% of the Fortune 100 trust PagerDuty as essential… more
- Wolters Kluwer (Sacramento, CA)
- …+ Own the full lifecycle of platform services, including observability, incident response, capacity planning, and performance optimization. + Create and maintain ... each week. **Technical Skills:** **Cloud Infrastructure & Platforms** + Proficiency in major cloud providers (AWS, Azure, GCP) with hands-on experience in core… more
- Brantner and Associates, Inc (El Cajon, CA)
- …expectations are clearly communicated. + Responsible for hiring, performance management , employee development, leadership and motivation for engineers and ... or related + Minimum of 7 years applicable including supervision/ management experience. + Six Sigma Green Belt training/Black belt...or provide any personal information, and to report the incident to your local authorities. Location: EL CAJON, CA,… more
- CVS Health (Sacramento, CA)
- …reliable, cost-effective platforms tailored to the needs of CVS Health. **Project Management :** Engage executives, department heads, and IT teams to plan, execute, ... This also involves communicating project progress to stakeholders. **Stakeholder Management ** - Cultivate and maintain relationships with application owners ensuring… more