-
Lead Site Reliability Engineer (Remote -CST)
- Cognizant (Carson City, NV)
-
Lead Site Reliability Engineer (Remote -CST)
About Cognizant
Cognizant (Nasdaq: CTSH) engineers modern businesses. We help our clients modernize technology, reimagine processes and transform experiences so they can stay ahead in our fast-changing world. Together, we're improving everyday life. See how at www.cognizant.com or @cognizant .
About Cognizant’s Digital Engineering Practice:
At Cognizant Digital Engineering, a small multi-functional team comprised of a Product Manager, an Architect, Full-Stack Developers, UI/UX designers and Big Data analysts builds higher quality software faster siloed individuals working independently. Small, forward-thinking engineering teams generate collective empathy and comradery, thus increasing their ability to anticipate unforeseen development scope changes and maintain high quality work you're doing. Across our US Studio system or within client development sites, our Digital Engineering teams ideate and develop innovative cloud-based solutions following a Lean-Agile process with DevOps culture. Working in Cognizant Digital Engineering provides DevOps engineers consistent opportunities to push digital boundaries while growing their exposure to transformational technologies.
The Role:
Cognizant is looking for an experienced and innovative Lead Site Reliability Engineer to serve our diverse base of global clients. As a member of our team, you will build innovative, cloud-based software that powers modern business. An ideal candidate is someone who enjoys working in a diverse, collaborative, geographically distributed team. Similarly, the ideal candidate is an expert engineer who values the “team”, drives continuous improvement and is unafraid to challenge the legacy status quo with creative cloud-based solutions.
**Candidate must be legally authorized to work in the United States without the need for employer sponsorship, now or at any time in the future**
Roles, Responsibilities
+ Design and implement robust architecture solutions that align with business goals and technical requirements.
+ Oversee the deployment and management of cloud infrastructure using AWS Terraform and Ansible.
+ Provide expertise in Unix and Linux systems to ensure optimal performance and security.
+ Utilize Docker and Kubernetes for containerization and orchestration of applications.
+ Implement and manage Openshift for scalable and efficient application deployment.
+ Monitor application performance using APM tools and ensure high availability.
+ Manage databases and SQL queries to support application functionality and data integrity.
+ Use Grafana and other monitoring tools to visualize and track system performance metrics.
+ Implement SRE practices for monitoring and observability ensuring system reliability and uptime.
+ Develop and maintain scripts in Python and Java to automate tasks and improve efficiency.
+ Utilize Splunk for log management and analysis to identify and resolve issues.
+ Apply chaos engineering principles to test system resilience and improve fault tolerance.
+ Collaborate with cross-functional teams to ensure seamless integration and deployment of solutions.
+ Provide technical guidance and mentorship to junior team members.
+ Stay updated with the latest industry trends and technologies to continuously improve architecture solutions.
+ Capable of performing SRE activities on their own
+ Excel on implementing core SRE principles and practices
+ Work closely with team on Estimation and Resource Planning and deliver solutions that meet business needs.
+ Understand management requirements and strategize planning from SRE and resiliency perspective
+ Specialized in building and managing automation
+ Triage and RCA of production incidents
+ Observability and monitoring with APM tools and creating dashboardsandalerts and automation for incidents
+ Leadership qualities like cross teams collaboration and effective communication
Required Qualifications
+ Possess strong technical skills in Cloud Basics Unix Linux Docker Kubernetes Openshift APM Database and SQL Grafana Monitoring Tools AWS Terraform Ansible Python Java and Splunk.
+ Demonstrate experience in SRE Monitoring & Observability and Chaos/Resiliency Engineering.
+ Have a good understanding of the Cards & Payments domain.
+ Exhibit excellent problem-solving and analytical skills.
+ Show strong communication and collaboration abilities.
+ Display a proactive approach to learning and adapting to new technologies.
Applications will be accepted until 06/20/2025.
Salary and Other Compensation:
The annual salary for this position is between $81,337 – $141,500 depending on experience and other qualifications of the successful candidate.
This position is also eligible for Cognizant’s discretionary annual incentive program, based on performance and subject to the terms of Cognizant’s applicable plans.
Benefits: Cognizant offers the following benefits for this position, subject to applicable
Disclaimer: The salary, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.
Why Choose Cognizant?
It takes a lot to succeed in today’s fast-paced market, and Cognizant Technology Solutions has become a leader in the industry. We love big ideas and even bigger dreams! We stand out because we put human experiences at the core. Our associates enjoy robust benefits and training opportunities from our industry-recognized, award-winning Academy team. You will have access to hundreds of technical training to keep your skillsets fresh and have opportunities to acquire certifications on the newest technologies.
Everything we do at Cognizant we do with passion—for our clients (fortune 100 companies), our communities, and our organization. It’s the defining attribute that we look for in our people.
If you love ambiguity, are excited by change, and excel through autonomy, we’d love to hear from you!
\#li-iy1
Cognizant is an equal opportunity employer that embraces diversity, champions equity and values inclusion. We are dedicated to nurturing a community where everyone feels heard, accepted and welcome. Your application and candidacy will not be considered based on race, color, sex, religion, creed, sexual orientation, gender identity, national origin, disability, genetic information, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws.
-
Recent Jobs
-
Lead Site Reliability Engineer (Remote -CST)
- Cognizant (Carson City, NV)
-
Production Control Senior Manager
- Lockheed Martin (Palmdale, CA)
-
Security/Loss Prevention Officer - Overnight
- Omni Hotels (Nashville, TN)
-
Groundskeeper (Part Time)
- Historic Hudson Valley (Pocantico Hills, NY)