Alerted.org | Alerted.org - Powering better job alerts

Senior Site Reliability Engineer

TEKsystems (Portsmouth, NH)

Apply Now

Description

About Us:

- Customer is one of the world’s leading enterprise software companies, modernizing, optimizing, and protecting the world’s most complex hybrid environments. With its engineering-centered culture, Customer is a global software leader building a comprehensive portfolio of industry-leading enterprise software enabling innovation, stability, scalability, and security for the largest global companies in the world.

- In the Agile Operations Division (AOD), we build software to support the world’s leading companies in making intelligent, data-driven decisions to achieve better business outcomes. Our industry leadership depends on a decades-long track record of delivering transformational solutions to teams who plan, build, test, and operate mission-critical software for the world’s largest and most complex businesses. To do this, we respond quickly and thoughtfully, innovate in the context of customer needs, and collaborate inclusively with customers and internal partners. Our business will nurture your intellect and give you opportunities to expand your skills even further.

About Customer Network Observability:

- AppNeta, a division of AOD, delivers comprehensive network performance monitoring solutions, providing deep visibility into end-user experience across complex, distributed networks. As a SaaS-based solution, IT and Network Ops teams can quickly pinpoint issues that affect network and business-critical cloud application performance, regardless of where they occur. We empower organizations to ensure optimal application performance and deliver exceptional digital experiences.

Our need:

- As a Senior Site Reliability Engineer, you will be responsible for the implementation and operation of cloud infrastructure for a SaaS based network monitoring solution.

In this role, you will:

- Lead the design, implementation, and operation of our SaaS platform, addressing concerns such as continuous integration, cloud infrastructure, solution deployment, and monitoring & alerting.

- Partner closely with our other engineering teams to evolve product/service architecture

- Migrate services into our freshly minted platform and collaborate with our dev teams to ensure that new services are designed with operability and observability in mind.

- Build out, deploy, and maintain our monitoring strategy and technology stack

- Automate all the things, freeing yourself and others from the tyranny of manual tasks.

- Contribute to the achievement of our 99.99% monthly availability by participating in our incident management process and quiet on-call rotation.

- Practice sustainable incident response and coordinate blameless postmortems.

- Mentor SRE team members, helping them reach their full potential.

- Assist in the definition, prioritization, and planning of work through backlog maintenance and collaboration on the product delivery roadmap.

Required Education and Experience

- Strong SRE/DevOps experience in building and operating cloud-based SaaS platforms

- Strong familiarity and experience with:

- ○ AWS and/or GCP

- ○ Infrastructure-as-code tooling (e.g. Terraform)

- ○ Containerization (Docker) and orchestration (Kubernetes, helm)

- ○ CI/CD pipelines, either self-hosted (e.g. Jenkins, TeamCity), or managed (e.g. GitHub Actions, GitLab)

- ○ Configuration management (Chef, Ansible, Puppet)

- ○ At least one programming language (Python preferred)

- ○ Monitoring solutions (e.g. Prometheus, Grafana, Cloudwatch, Stackdriver, ELK)

- ○ Linux systems, automation, package management

- Demonstrable aptitude to learn new technologies, and apply that knowledge to solve real problems

- Strong interpersonal communication skills (listening, speaking, and writing)

- Experience (or interest) in team lead/scrum master/project management responsibilities

- Experience operating large-scale, distributed systems on top of cloud infrastructure

Skills

Devops, Cloud, Python, Terraform, Kubernetes, Docker, Automation, Jenkins, gcp, Aws, Azure

Top Skills Details

Devops,Cloud,Python,Terraform,Kubernetes,Docker,Automation,Jenkins,gcp

Additional Skills & Qualifications

Reports to: Scott Allard – Sr Manager, SRE

Employment Type (Contact/Perm): Permanent

Onsite/Remote/Hybrid: Onsite 5x per week

Experience Level

Expert Level

Pay and Benefits

The pay range for this position is $140000.00 - $170000.00/yr.

Overview:Broadcom offers a competitive compensation and benefits package, including health insurance, retirement plans, and paid time off. We are committed to fostering a collaborative and inclusive work environment where employees can thrive and make a meaningful impact.

Benefits:- RSSP Matching- Full comprehensive benefits package (health and dental): 100% coverage, rather than just 80%.- Vacation: minimum 3 weeks to start – unlimited to discretion of manager.- Bonus - Broadcom does annual performance bonuses- Stock - RSUs are a part of the compensation package

Total Compensation: Broken down into 3 parts:1. Base Salary – expectations to be disclosed at first stage of screening process.2. Performance based bonus - % of salary. Depending on company performance you can even reach up to 120% of performance bonus(annually).Restricted Stock Units (RSU’s) – Our RSU’s at Broadcom begin vesting directly from your first quarter with our company and are paid out over 4 years. Vested out quarterly, RSU’s can either be kept in stock, moved over to your own private brokerage, or paid out quarterly in USD. Every year you get quarterly vested per RSU’s vested, which is higher than other companies, per share. Broadcom stock is very liquid, so those uneducated on stock options can sell every 3 months if preferred. Next year, Another 4-year allotment. Year 3: Another 4-year allotment. (For further explanation on this breakdown connect with Broadcom manager on this in final round interview if needed).

Workplace Type

This is a fully onsite position in Portsmouth,NH.

Application Deadline

This position is anticipated to close on May 19, 2025.

About TEKsystems and TEKsystems Global Services

We’re a leading provider of business and technology services. We accelerate business transformation for our customers. Our expertise in strategy, design, execution and operations unlocks business value through a range of solutions. We’re a team of 80,000 strong, working with over 6,000 customers, including 80% of the Fortune 500 across North America, Europe and Asia, who partner with us for our scale, full-stack capabilities and speed. We’re strategic thinkers, hands-on collaborators, helping customers capitalize on change and master the momentum of technology. We’re building tomorrow by delivering business outcomes and making positive impacts in our global communities. TEKsystems and TEKsystems Global Services are Allegis Group companies. Learn more at TEKsystems.com.

The company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

Apply Now

"Alerted.org

Advanced Search

Senior Site Reliability Engineer

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?