-
Site Reliability Engineer, Infrastructure
- Cisco (San Francisco, CA)
-
Who We Are
The name ThousandEyes was born from two big ideas: the power to see what’s not ordinarily possible, and the ability to collect intelligence from vantage points as diverse and global as the Internet. As organizations depend on cloud services, the Internet has become their defacto network connecting cloud applications to users. Our Internet and cloud intelligence platform is like a ‘Google maps of the Internet’, providing the only collectively powered view of digital experiences end-to-end. We enable our customers made up of the world’s largest and fastest-growing brands, to identify problems before they impact revenue, brand reputation, or employee productivity.
In August 2020, Cisco Systems completed the acquisition of ThousandEyes, which now forms the ThousandEyes Business Unit within Cisco’s Network Services Business Group, and is a foundational component of Cisco’s growing Observability business.
About The Role
Are you ready to put your hands in production and manage clusters that receive hundreds of millions of messages per hour?
The Site Reliability Engineering team is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. The role will help the company’s core infrastructure services, maintaining a constantly growing infrastructure capable of handling a very high volume of incoming data per day.
We believe in operations/infrastructure/everything as code, which makes our distributed team efficient, functional, and very effective.
You Can Expect To Work On
You will be an integral part of designing and operating large-scale highly available distributed systems in the cloud. You will collaborate with our application development teams to ensure the reliability and performance of our infrastructure.
As part of the team, you will work multi-functionally to ensure the ThousandEyes platform infrastructure, and services are designed and optimized for availability, latency, and performance.
Minimum Qualifications
* Proficient in writing high-quality code in Python, Go, or equivalent languages.
* Expert using Unix/Linux systems, the kernel, system libraries, file systems, and client-server protocols.
* Strong Infrastructure as Code skills, ideally with Terraform, Puppet, and Kubernetes.
* Experience working with AWS
* Drive and build automation wherever possible, enabling our infrastructure and platforms to scale thoughtfully.
Preferred Qualifications
* Ability to design and implement scalable and well-tested solutions.
* Strong communication and documentation skills.
* Strong sense of ownership, drive, and passion for attention to detail.
Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.
Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.
-
Recent Searches
- Lead Software Development Engineer (Idaho)
- Process Development Senior Associate (California)
- Practice Office Assistant START (Pennsylvania)
Recent Jobs
-
Site Reliability Engineer, Infrastructure
- Cisco (San Francisco, CA)
-
Director of Global Regulatory Affairs
- Pall (Miami, FL)
-
Senior Platform Engineer (Current CompTIA Security + or ability to obtain)
- Raytheon (Richardson, TX)
-
Sr. Director Operational QA
- Aldevron (Miami, FL)