-
Senior Service Assurance Engineer
- American Express (Phoenix, AZ)
-
Description
You Lead the Way. We’ve Got Your Back.
With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a career journey that’s unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally.
At American Express, you’ll be recognized for your contributions, leadership, and impact—every colleague has the opportunity to share in the company’s success. Together, we’ll win as a team, striving to uphold our company values and powerful backing promise to provide the world’s best customer experience every day. And we’ll do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong.
Join Team Amex and let's lead the way together.
As part of our diverse tech team, you can architect, code and ship software that makes us an essential part of our customers’ digital lives. Here, you can work alongside talented engineers in an open, supportive, inclusive environment where your voice is valued, and you make your own decisions on what tech to use to solve challenging problems. American Express offers a range of opportunities to work with the latest technologies and encourages you to back the broader engineering community through open source. And because we understand the importance of keeping your skills fresh and relevant, we give you dedicated time to invest in your professional development. Find your place in technology on #TeamAmex.
We’re looking for a Site Reliability/Application Support Engineers/Run Time Engineers (SRE/AS) responsible for Digital Payments application performance, availability, and reliability. Candidate is responsible to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. Site Reliability Engineering (SRE) is a continuous engineering discipline that effectively combines software development and systems engineering to build and run scalable, distributed, fault-tolerant systems. This role will ensure that American Express internal and external services have reliability and uptime appropriate to users' needs. We also ensure a continuous improvement, while keeping an ever-watchful eye, automated, on capacity and performance.
This role will drive the SRE/AS mindset which strives to use software engineering to build and run better production systems. You will write software to optimize day to day work through better automation, monitoring, alerting, testing, and deployment. You’ll be expected to work with several Technology partners to identify areas of opportunity within the availability platform and build a solution to automate monitoring solutions for the modernization platform, technology, and constant innovations to drive efficiencies. You will be responsible for implementing tracing, monitoring, tooling solutions to maximize the performance and availability of our Digital Payments applications.
The Senior Service Assurance Engineer role is a hands-on Senior Architect Level position supporting American Express Run Time Engineering and Application Support part of Production Management and Engineering organization. This role requires a deep understanding of digital payment ecosystems, operational excellence, and leadership in driving service reliability.
What you will be doing:
+ Research latest technology, concepts, conceptualize solution and develop proof of concept that will improve resiliency and performance of the production infrastructure
+ Design and implement innovative solution/framework that will improve software engineering velocity, infrastructure resiliency and security, and data availability
+ Develop common framework components (to be leveraged by enterprise applications), define standards for configuration, monitoring, reliability, and performance engineering
+ Work with Technology teams to resolve major incidents
+ Work closely with development, operations, and product teams to align reliability goals with business objectives.
+ Define and track SLAs, SLOs, and error budgets to measure and improve service performance.
+ Mentor and guide junior engineers, fostering a culture of reliability and operational excellence.
+ Ensure systems meet regulatory and security requirements for digital payment products.
+ Build and enhance automation frameworks for deployment, testing, and operational tasks to reduce manual effort and improve efficiency and ensure the highest levels of availability
Qualifications:
+ BS or MS degree in computer science, computer engineering, or other technical discipline, or equivalent 8 years of work experience in DevOps/SRE (web applications)
+ Development or support of Java/J2EE/REACT JS applications, and Node applications
+ Good understanding of automation implementations related to observability, reliability, and Self-servicing
+ Hands on experience with frameworks - Spring Boot, Vertex, NodeJS
+ Experience in designing mission critical highly available enterprise applications
+ Hand on experience with performance testing and Java applications tuning
+ Experience managing relational and NoSQL databases such as DB2, Postgres, Mongo, Couchbase, Cassandra etc.
+ Strong knowledge of Linux internals and experience managing Linux systems in high traffic environments
+ Strong interpersonal communication skills and the ability to work well in a diverse team-focused environment
+ Experience with Splunk and/or ELK
+ Good understanding of cloud technologies - Kubernetes, OpenShift, Docker etc.
+ Knowledge of Public Cloud technologies GCP, AWS, AZURE etc. would be an advantage
+ Monitoring and analyzing PMI data
+ Hands on experience on enterprise tools set such as Grafana, Dynatrace, AppDynamics, BMC, Prometheus etc.
+ Understanding of using Agile Practices in Operations teams
+ Experience in handling DDoS/BOT attack and different security remediations
+ Working experience with Network load balancers, Global Traffic Managers (GTMs), Local Traffic Managers (LTMs)
+ Hands on experience on configuring Splunk, Grafana dashboards, ElastAlerts etc.
+ Working experience on network rules creation, load balancer configurations, network packet analysis
+ On call / 24
* 7 support required
+ Analytical knowledge and exposure on root cause identification using analyzer tools like IBM support assistant, Splunk etc.
+ Certificate Management automation - Message signing, SSL, etc.
Qualifications
Salary Range: $110,000.00 to $190,000.00 annually bonus benefits
The above represents the expected salary range for this job requisition. Ultimately, in determining your pay, we’ll consider your location, experience, and other job-related factors.
We back our colleagues and their loved ones with benefits and programs that support their holistic well-being. That means we prioritize their physical, financial, and mental health through each stage of life. Benefits include:
+ Competitive base salaries
+ Bonus incentives
+ 6% Company Match on retirement savings plan
+ Free financial coaching and financial well-being support
+ Comprehensive medical, dental, vision, life insurance, and disability benefits
+ Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
+ 20 weeks paid parental leave for all parents, regardless of gender, offered for pregnancy, adoption or surrogacy
+ Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
+ Free and confidential counseling support through our Healthy Minds program
+ Career development and training opportunities
For a full list of Team Amex benefits, visit our Colleague Benefits Site .
American Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law. American Express will consider for employment all qualified applicants, including those with arrest or conviction records, in accordance with the requirements of applicable state and local laws, including, but not limited to, the California Fair Chance Act, the Los Angeles County Fair Chance Ordinance for Employers, and the City of Los Angeles’ Fair Chance Initiative for Hiring Ordinance. For positions covered by federal and/or state banking regulations, American Express will comply with such regulations as it relates to the consideration of applicants with criminal convictions.
We back our colleagues with the support they need to thrive, professionally and personally. That's why we have Amex Flex, our enterprise working model that provides greater flexibility to colleagues while ensuring we preserve the important aspects of our unique in-person culture. Depending on role and business needs, colleagues will either work onsite, in a hybrid model (combination of in-office and virtual days) or fully virtually.
US Job Seekers - Click to view the “ Know Your Rights ” poster. If the link does not work, you may access the poster by copying and pasting the following URL in a new browser window: https://www.eeoc.gov/poster
Employment eligibility to work with American Express in the United States is required as the company will not pursue visa sponsorship for these positions.
**Job:** Technology
**Primary Location:** US-Arizona-Phoenix
**Other Locations:** US-Arizona-Phoenix
**Schedule** Full-time
**Req ID:** 25007573
-
Recent Jobs
-
Senior Service Assurance Engineer
- American Express (Phoenix, AZ)
-
Applied Scientist, NGDE Science
- Amazon (Seattle, WA)