- TEKsystems (Columbus, OH)
- …* Incident Command & SRE Lead P1/P2 bridges for ML/LLM and batch pipelines. Drive root cause analysis, publish blameless post mortems, and ensure fixes are ... automated-not repeated. * DevSecOps Automation Patch CI/CD jobs, Helm charts, and Python utilities as part of incident follow up. Embed vulnerability scans, rollback logic, and change ticket integration. * Reliability Governance Define & track MTTR, change… more