- 
        Site Reliability Engineer (SRE)
- Cognizant (Arizona City, AZ)
- 
             About the role As a Site Reliability Engineer (SRE), you will make an impact by designing and implementing advanced observability solutions for edge computing environments. You will be a valued member of our Infrastructure & Operations team, collaborating with engineering and platform teams to ensure high availability, reliability, and performance across distributed systems. In this role, you will: + Design and implement observability frameworks for edge environments, including monitoring, logging, tracing, and metrics collection. + Define and maintain SLIs, SLOs, and business KPIs to improve system reliability across edge and centralized infrastructure. + Build and optimize dashboards, visualizations, and alerting systems for real-time insights and rapid incident response. + Implement distributed tracing and log aggregation systems to troubleshoot complex issues in edge computing. + Collaborate with engineering teams to embed observability best practices into applications and infrastructure. + Drive proactive issue detection and resolution, reducing MTTD and MTTR across distributed systems. + Lead incident postmortems and implement observability-driven improvements to prevent recurrence. + Develop automation scripts and tools to enhance observability pipelines, addressing edge-specific challenges like bandwidth and connectivity. What you need to have to be considered + 3–5 years of experience in service reliability/operations for large-scale, high-performance applications in hybrid environments (on-prem and cloud). + Strong scripting and automation skills for building dashboards and managing application performance. + Proficiency in programming languages such as Go, Python, Java, or Rust. + Hands-on experience with databases (Oracle, SQL Server, Redis, Clickhouse, Postgres, MongoDB, or time-series DBs). + 2+ years of experience transitioning platforms to cloud and containerization (GCP, AWS, Rancher, or similar). + Experience maintaining containerized applications in GKE/RKE/AKE environments. + Expertise in implementing cloud observability using OpenTelemetry (OTEL) for monitoring and distributed tracing. + Knowledge of networking protocols (TCP/IP, HTTP, DNS) and troubleshooting in high-pressure scenarios. These will help you stand out + Experience managing application availability for 24x7 high-availability platforms. + Familiarity with monitoring tools like Splunk, AppDynamics, Grafana/Prometheus, and Dynatrace. + Hands-on experience with CI/CD tools and Rally, Confluence. + Knowledge of in-memory caching solutions (Redis preferred). + Strong debugging skills across integrated technical platforms and API gateways. + Exposure to GCS, Cloud SQL, Spanner, Firestore, and enterprise-level infrastructure operations. + Experience with HashiCorp Vault, Vertex AI, Gen AI, and BigQuery. Work model: On-site This is an onsite position requiring presence at a Cognizant or client location in Arizona City, AZ and/or Scottsdale, AZ. We strive to provide flexibility wherever possible and support a healthy work-life balance through our wellbeing programs. The working arrangements for this role are accurate as of the date of posting. This may change based on the project you’re engaged in, as well as business and client requirements. Rest assured; we will always be clear about role expectations. Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government issued ID during each interview. Salary and Other Compensation: The annual salary for this position is between $60,000 – $93,500 depending on experience and other qualifications of the successful candidate. This position is also eligible for Cognizant’s discretionary annual incentive program, based on performance and subject to the terms of Cognizant’s applicable plans. **Benefits:** Cognizant offers the following benefits for this position, subject to applicable eligibility requirements: • Medical/Dental/Vision/Life Insurance • Paid holidays plus Paid Time Off • 401(k) plan and contributions • Long-term/Short-term Disability • Paid Parental Leave • Employee Stock Purchase Plan Disclaimer: The salary, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law. Cognizant is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. 
 
 
-