- The Hartford (Hartford, CT)
- …Lead triage and resolution of high-severity incidents, minimizing business impact. + Improve incident response processes and reduce mean time to recovery (MTTR). ... Staff Reliability Engineer - IE07KE We're determined to make a...(Terraform, CloudFormation). Operational Excellence + Proven ability to lead incident response and root cause analysis. +… more
- Insight Global (Frisco, TX)
- …address and remediate risks, optimize Varonis platform configurations, and improve alerting and incident response workflows. Once the rollout is complete and the ... Job Description We are seeking a mid-to-senior level Varonis Engineer (3-5 years of experience) to support DaaS security...* Build automation and tooling to support operations and incident response teams. * Troubleshoot support escalation… more
- LinkedIn (Mountain View, CA)
- …different types of engineers to raise the bar for operational excellence and incident response + Define and build frameworks to improve monitoring, alerting, ... deeply to culture, hiring, and technical excellence + Lead incident response and post- incident reviews...availability and performance + Previous experience in a Distinguished Engineer or equivalent role at a high-growth or web-scale… more
- Celonis (New York, NY)
- …operational knowledge and runbooks, embedding SRE best practices into onboarding, incident response , and platform architecture standards. **The qualifications ... analysis and restoration within defined SLOs, while continuously improving detection and response mechanisms. + Engineer solutions to enhance the availability,… more
- NetApp (San Jose, CA)
- …Disseminating expertise through technical reviews, presentations, and documentation. + Incident Response & Remediation: Providing expert technical leadership ... **Distinguished Engineer , Data Governance and Privacy ** Distinguished Engineers at NetApp are individual contributors who strive to be diverse in technology and… more
- Insight Global (Newark, CA)
- …expertise in Azure architecture, disaster recovery, business continuity, and designing Major Incident Response Plans (MIRPs). Candidates should have over five ... company is looking to bring on a Cloud Resiliency Engineer to support their team. This role focuses on...hands-on experience in high availability design, resiliency patterns, and incident response coordination. Certifications such as Azure… more
- Insight Global (Dallas, TX)
- …testing, and vulnerability scans to proactively identify and address security weaknesses. * Incident Response : Lead incident response efforts to ... We are currently seeking an experienced Senior Cyber Security Engineer to join our Information Security team. The ideal...and mitigate security incidents and breaches. Develop and maintain incident response plans and procedures. * Security… more
- The Hartford (Charlotte, NC)
- …and Dynatrace Davis AI capabilities to enhance predictive analytics and automated incident response . Utilize AI-driven insights to proactively identify and ... Staff Reliability Engineer - IE07KE We're determined to make a...Partner with infrastructure teams to design and implement intelligent incident routing, enhanced monitoring/alerting capabilities and automated service restoration… more
- CACI International (Chantilly, VA)
- …results and collaborate with IT teams to remediate identified security gaps. + Incident Response and Threat Mitigation: Develop and implement incident ... Information Systems Security Engineer Job Category: Information Technology Time Type: Full...to minimize attack surfaces and potential impact. + Security Incident Investigation: Lead investigations into security breaches, identifying the… more
- LinkedIn (Mountain View, CA)
- …most for system health. + Lead improvements to monitoring, alerting, and incident response practices across engineering teams. + Partner with cross-functional ... growth in these two areas. As the Sr. Staff Engineer leading Engineering Excellence, you will play a critical...teams. + Lead initiatives to improve monitoring, alerting, and incident response , enabling proactive management of system… more