-
Senior Cloud Engineer
- System One (Herndon, VA)
-
Senior Cloud Engineer
100% remote
Compensation: $70/hr W2
NO C2C
US citizenship required per government contract
Security Clearance: Must be able to obtain Public Trust clearance
ALTA IT Service is seeking a highly skilled Sr. Cloud Engineer to support the daily operations and long-term reliability of our cloud-based infrastructure. This role is critical for ensuring uptime, performing proactive maintenance, troubleshooting issues and implementing fixes across our cloud environments. You will work closely with development, operations and security teams to ensure the scalability, performance and security of cloud applications. The ideal candidate will be responsible for maintaining cloud-based applications and infrastructure on AWS.
Responsibilities:
• Deploy applications across multiple environments (dev, staging, prod) and ensure consistency and stability
• Build reusable pipeline templates, jobs and stages for CI/CD consistency across teams
• Collaborate with developers to containerize and deploy applications using ECS and Lambda
• Configure GitLab Runners and manage environment-specific variables and secrets
• Define and deploy readiness and liveness probes for containers running in EKS/ECS
• Write custom scripts for CloudWatch custom metrics and alarms based on application specific probes
• Monitor deployments and system health using CloudWatch and other tools
• Implement rollback strategies and manage version control during deployments
• Troubleshoot and resolve deployment issues and improve pipeline performance and reliability
• Proficient with Python, Bash, YAML/JSON, Node.js, Lambda functions
• Perform daily health checks using AWS CLI or scheduled Lambda scripts to check health and log/report results
• Set up monitoring thresholds, dashboards, and metrics for application and infrastructure
• Perform root cause analysis and incident correlation using monitoring and performance analysis tools
• Maintain a central inventory of all licensed software deployed in AWS environments
• Maintain accurate documentation on infrastructure and procedures
• Patch assessment and maintenance of infrastructure software, to include third party software patches
• Develop a patch testing schedule and rollout plan to include rollback and recovery
• Create and manage change records. Participate in PI planning/ Agile ceremonies
• Keep cloud environments compliant with security standards and best practices
• Orchestrate failover and restoration of ECS/ EKS services, Lambda functions, databases and other infrastructure components
• Test and document regional failover playbooks and recovery runbooks
• Ensure compliance with RTO (Recovery Time Objective) and RPO (Recovery Point Objective) requirements
• Participate in on-call rotations to support 24/7 production systems and respond to incidents as they arise
Required Qualifications:
• BA/BS in IT, Computer Science or related field (or equivalent work experience may be accepted in lieu of the degree
• 8+ years of IT experience. 5+ years of experience in cloud support, infrastructure maintenance or IT operations.
• Experience with Infrastructure as Code (Terraform, CloudFormation)
• Strong proficiency in AWS Lambda (writing, deploying and, optimizing)
• Hands-on experience with CI/CD tools (GibHub, GitLab, Kubernettes, DevOps)
• Scripting skills for automation and maintenance tasks (Bash, Python)
• Cloud certifications (AWS DevOps Engineer, Solutions Architect Associate)
• Strong written and verbal communication skills for technical and non-technical stakeholders
• Excellent analytical and problem-solving skills
• Must be a US Citizen.
• Must be able to obtain and maintain a Public Trust clearance
Preferred Qualifications:
• Ability to diagnose performance issues in cloud environments
• Pre-check and post-check scripts for validating system health
• Familiarity with container orchestration (Docker, ECS, Kubernetes)
• Knowledge of ITIL practice or incident management frameworks
System One, and its subsidiaries including Joulé, ALTA IT Services, and Mountain Ltd., are leaders in delivering outsourced services and workforce solutions across North America. We help clients get work done more efficiently and economically, without compromising quality. System One not only serves as a valued partner for our clients, but we offer eligible employees health and welfare benefits coverage options including medical, dental, vision, spending accounts, life insurance, voluntary plans, as well as participation in a 401(k) plan.
System One is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, age, national origin, disability, family care or medical leave status, genetic information, veteran status, marital status, or any other characteristic protected by applicable federal, state, or local law.
#M2
#LI-VH1
#DI-VH1
Ref: #850-Rockville (ALTA IT)
System One, and its subsidiaries including Joulé, ALTA IT Services, CM Access, TPGS, and MOUNTAIN, LTD., are leaders in delivering workforce solutions and integrated services across North America. We help clients get work done more efficiently and economically, without compromising quality. System One not only serves as a valued partner for our clients, but we offer eligible full-time employees health and welfare benefits coverage options including medical, dental, vision, spending accounts, life insurance, voluntary plans, as well as participation in a 401(k) plan.
System One is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, age, national origin, disability, family care or medical leave status, genetic information, veteran status, marital status, or any other characteristic protected by applicable federal, state, or local law.
-