-
Mid-Level Site Reliability Engineer
- Insight Global (Irvine, CA)
-
Job Description
One of Insight Global’s customers is looking to onboard a Mid-Level Site Reliability Engineer with strong expertise in modern DevOps practices, cloud infrastructure, observability, and platform security. This role partners directly with product teams to support deployments, build reliable systems, and strengthen platform capabilities across Kubernetes, AWS, and CI/CD pipelines. This individual will be responsible for maintaining and optimizing AWS-based Kubernetes environments, ensuring reliability and security, building, maintaining, and enhancing CI/CD pipelines (ArgoCD, GitOps), and implementing observability using OpenTelemetry. This is a 6-month contract with the possibility of conversion and will be on-site 5 days a week in Irvine, CA.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to [email protected] learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Skills and Requirements
4–6 years of SRE/DevOps engineering experience
Kubernetes platform engineering (multi-cluster, deployment, troubleshooting)
AWS experience (GCP/Azure nice to have)
CI/CD with ArgoCD and GitOps workflows
Infrastructure as Code using Terraform & Helm
OpenTelemetry for metrics, logs, and tracing
Strong scripting with Python
Experience supporting monitoring, reliability, and performance across distributed systems Prometheus or Grafana
Multi-cloud familiarity
Experience with Kafka or event streaming platforms
-
Recent Jobs
-
Mid-Level Site Reliability Engineer
- Insight Global (Irvine, CA)
-
Electronic Integrated Systems Mechanic
- Headquarters, Air Force Reserve Command (Lackland AFB, TX)
-
Senior Project Engineer
- Amrize (Jacksonville, FL)
-
Warehouse Part Time Overnight
- Lowe's (Apex, NC)