-
Remote Observability Architect
- Insight Global (Herndon, VA)
-
Job Description
A client of Insight Global is looking for an Enterprise Observability Architect to lead the design and strategy of our observability ecosystem. This role owns architecture for logging, metrics, traces, events, and SLO/SLI frameworks, and drives multi-tenant Splunk Cloud patterns and large-scale migrations from Splunk Enterprise. You will champion OpenTelemetry-first instrumentation, define cost-optimized data tiers and retention policies, and enable Splunk ES, ITSI, and APM integrations. Responsibilities include HA/DR design, observability-as-code via CI/CD, and enforcing security and compliance standards (FISMA, NIST, HIPAA). You will partner with product teams to set SLIs/SLOs, reduce MTTR through actionable alerting, and lead continuous improvement. This role also involves mentoring engineers, shaping technical roadmaps, and delivering knowledge transfer across operations teams.
Compensation: $80/hr. -- 88/hr. Exact compensation may vary based on several factors, including skills, experience, and education. Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to [email protected] learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Skills and Requirements
• Bachelors degree and 8+ years of experience
• 8+ years of experience years in large-scale observability/SRE/production engineering, including 3+ years as lead/architect
• Extensive hands-on experience in Splunk Cloud/Core, index design, data models/KSOs, SPL, content packs, ES/ITSI, and at least one high-volume Splunk
• Experience designing observability for AWS/Azure/GCP and Kubernetes
• Experience following federal compliance standards such as FISMA/NIST, RMF, HIPPA • Splunk certifications (e.g., Splunk Certified Architect, Splunk Cloud Certified Admin, ES/ITSI certs).
• Experience with complementary stacks (Grafana/Prometheus/Loki/Tempo, Elastic, Datadog, New Relic) and event correlation/AIOps.
• Knowledge of ITSM/ITIL processes, service catalogs/CMDB, and service ownership models.
• Background supporting federal agencies or healthcare-scale environments.
-