- The Walt Disney Company (Glendale, CA)
- …and multi-agent workloads. + Build automation for safe rollout, monitoring, and incident response . **Observability, Reliability & Cost Management** + Implement ... platforms + Proven experience with observability stacks (Datadog, Prometheus, Grafana) and incident response automation. + Familiarity with AI/LLM APIs (OpenAI,… more
- Robert Half Technology (Fresno, CA)
- …a seasoned, hands-on cybersecurity professional who enjoys advanced threat analysis, incident response , and security engineering within a regulated environment, ... in the financial services industry is seeking a Cybersecurity Engineer to join their growing Information Security team. This...you will: + Perform threat analysis, threat hunting, and incident response from detection through remediation +… more
- Ford Motor Company (Dearborn, MI)
- …management, due diligence, and continuous monitoring of vendor security posture. * Incident Response Oversight: Partner with Cyber Defense leadership to align ... on incident response goals, requirements and data to...Professional certifications such as CISSP, CISM, CCSP, or equivalent cloud -specific certifications (eg, Azure Security Engineer Associate,… more
- Realtor (Austin, TX)
- …and optimize CI/CD spend (CircleCI, Argo CD optimization) Chaos Engineering & Incident Response + Design chaos engineering experiments to identify system ... incident reviews and drive systemic improvements + Mentor engineers on incident response , communication, and escalation; contribute to System Health Scorecard… more
- Realtor (Austin, TX)
- …decisions and CI/CD spend optimization (CircleCI, Argo CD) **Chaos Engineering & Incident Response ** + Execute chaos engineering experiments to identify system ... the Role** We are seeking a Senior Site Reliability Engineer to join our newly formed Operations Excellence organization,...post- incident reviews and implement improvements + Support incident response processes and contribute to System… more
- Microsoft Corporation (Redmond, WA)
- …needs. + Embed Operational Excellence: Incorporate live site readiness, monitoring, and incident response into the development lifecycle. + Promote Engineering ... durability, and operational efficiency, including experience with live site operations, incident response , and performance optimization. + 2+ years of… more
- NTT DATA North America (Austin, TX)
- …device status + Generate reports on service performance, file access metrics, incident response , and usage + Maintain thorough documentation for configuration, ... remote file access for end users, troubleshooting issues across cloud and on-prem infrastructure. This engineer must...processes, troubleshooting steps, and incident history + CTERA to Azure Files migration design… more
- TP-Link North America, Inc. (Irvine, CA)
- …to security and compliance standards, including ISO27001, SOC2, and GDPR. + Lead incident response efforts to troubleshoot and resolve production issues quickly. ... looking for a passionate and experienced Senior Site Reliability Engineer to join our team and play a crucial...team and play a crucial role in ensuring our cloud platform's security, Reliability, scalability, and operational excellence. About… more
- TP-Link North America, Inc. (Irvine, CA)
- …and compliance standards, including ISO27001, SOC2, and GDPR. + Participate in incident response efforts to troubleshoot and resolve production issues quickly. ... We're looking for a passionate and experienced Site Reliability Engineer to join our team and play a crucial...team and play a crucial role in ensuring our cloud platform's security, Reliability, scalability, and operational excellence. About… more
- Waystar (Lehi, UT)
- …and manage error budgets. + Build automation for deployment, monitoring, and incident response . + **Observability & Monitoring** + Enhance system observability ... data products. This role is ideal for an experienced engineer who thrives in data-intensive environments and is passionate...and alerts to proactively detect and resolve issues. + ** Incident Response & Postmortems** + Participate in… more