"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • Cloud Operations Engineer

    Sage (Atlanta, GA)



    Apply Now

    Cloud Operations Engineer

    Job Description:

    Position Summary

    We are seeking a proactive and collaborative Cloud Operations Engineer. In this role, you will be responsible for shaping how we deliver infrastructure at scale, ensuring stability and quality across environments through automation, continuous improvement, and strong alignment with engineering teams.

     

    Our infrastructure is treated as an internal product, developed and maintained through a disciplined software development lifecycle (SDLC). Just like our applications, infrastructure is planned, built, tested, deployed, and supported with high standards for performance, reusability, and reliability.

     

    You will work closely with developers, site reliability engineers, and platform tooling teams to ensure deployments are efficient, safe, and transparent—supporting multiple products across AWS and Azure with varying release cadences.

    Key Responsibilities

    Partner with engineering, QA, and platform teams to ensure infrastructure releases are well-planned, fully coordinated, and meet operational standards.

     

    Design and maintain CI/CD pipelines that accelerate delivery while maintaining stability and visibility across all environments.

     

    Automate infrastructure provisioning and configuration using Terraform, Ansible, and Python, with a focus on reusability and consistency.

     

    Own and troubleshoot build systems for PHP applications, including make and maven-based workflows.

     

    Monitor the health of release processes, ensuring issues are detected early and releases can be rolled back or remediated with minimal impact.

     

    Define and track key DevOps metrics (e.g., deployment frequency, change failure rate, MTTR, lead time for changes) to assess process effectiveness and guide improvements.

     

    Produce regular reports and dashboards that provide stakeholders with visibility into infrastructure activity, deployment success rates, and areas for optimization.

     

    Lead efforts to continuously improve our build and deployment workflows, reducing friction and increasing velocity for infrastructure delivery.

     

    Provide documentation, change tracking, and clear release visibility to all stakeholders, supporting transparency and auditability.

     

    Help establish and evolve operational readiness standards, contributing to production stability and post-deployment success.

     

    Support structured weekly maintenance windows and coordinate emergency deployments when required.

     

    Champion best practices around change control, secure automation, and resilient cloud operations.

     

    Required Skills and Experience

     

    5+ years in Site Reliability Engineering, Release Engineering, DevOps, or infrastructure automation roles within cloud-centric environments.

     

    Strong experience deploying to AWS and Azure using infrastructure-as-code tools (Terraform, Ansible).

     

    Skilled in scripting and automation with Python, Bash, and PowerShell.

     

    Experience building and managing CI/CD pipelines using GitHub Actions, Jenkins, or Azure DevOps.

     

    Familiarity with Kubernetes and modern container-based deployment models.

     

    Strong understanding of change management practices, rollback strategies, and release governance.

     

    Comfortable collaborating across engineering, operations, and security disciplines.

     

    Experience supporting infrastructure for applications with differing release frequencies and lifecycle models.

    Preferred Qualifications

    Exposure to GitOps practices and policy-based deployments.

     

    Experience building dashboards or reports using Grafana, Power BI, or similar tools.

     

    Knowledge of metrics such as MTTR, deployment frequency, and change failure rate, with experience using them to drive improvement.

     

    Familiarity with observability tooling (e.g., Prometheus, Grafana, ELK) and alert-driven operations.

     

    Ability to communicate clearly, translate operational needs into technical solutions, and contribute to a culture of learning and improvement.

    Key Responsibilities:

    Ensure production stability by proactively monitoring, troubleshooting, and resolving infrastructure and deployment issues across AWS and Azure environments

     

    Support the availability and reliability of hosted products through well-defined operational procedures, automation, and system hardening

     

    Implement and manage infrastructure as code using Terraform and Ansible to ensure consistent, repeatable environments

     

    Own and improve CI/CD pipelines to support frequent, safe, and auditable deployments of both infrastructure and application code

     

    Coordinate and execute deployments in partnership with development teams, ensuring alignment with change management and release processes

     

    Automate build, packaging, and deployment tasks to reduce manual work and eliminate release bottlenecks

     

    Maintain observability tooling (monitoring, logging, alerting) to ensure critical systems are visible and actionable

     

    Participate in incident response and postmortem processes, helping to identify and fix root causes and prevent recurrence

     

    Document operational processes, runbooks, and readiness checklists to support service ownership and onboarding

     

    Collaborate with engineering, security, and architecture teams to support platform and service readiness for new products

     

    Contribute to improving production maintenance workflows, reducing change failure rates, and increasing mean time between failures (MTBF)

    #LI-BJ1

    Function:

    Cloud Operations

    Country:

    United States

    Office Location:

    Atlanta

    Work Place type:

    Hybrid

     

    Advert

     

    Working at Sage means you’re supporting millions of small and medium sized businesses globally with technology to work faster and smarter. We leverage the future of AI, meaning business owners spend less time doing routine tasks, like entering invoices and generating reports, and more time pursuing their ambitions.

     

    Our colleagues are the best of the best. Because to achieve extraordinary outcomes, we need extraordinary teams. This means infusing Sage with people who knock down barriers, continuously innovate, and want to experience their potential.

     

    Learn more about working at Sage:sage.com/en-us/company/careers/working-at-sage/

     

    Watch a video about our culture:youtube.com/watch?v=h1-vs3zIpnc

     

    We celebrate individuality and welcome you to join us if you embrace all backgrounds, identities, beliefs, and ways of working. If you need support applying, reach out [email protected].

     

    Learn more about DEI at Sage:sage.com/en-us/company/careers/diversity-equity-and-inclusion/

     

    Equal Employment Opportunity (EEO)

     

    Sage is committed to Equal Employment Opportunity and providing reasonable accommodations to applicants with physical and/or mental disabilities.

     

    In order to provide equal employment and advancement opportunities to all individuals, employment decisions at Sage will be based on merit, qualifications, and abilities. Sage does not discriminate in employment opportunities or practices on the basis of race, color, religion, sex, national origin, age, protected disability, veteran status, sexual orientation, gender identity, genetic information, or any other characteristic protected by applicable law.

     


    Apply Now



Recent Searches

  • Data Engineer Liquidity Reporting (Ohio)
  • UKG Applications Analyst III (United States)
  • Senior AI Engineering Manager (United States)
  • Product Software Engineering Manager (Montana)
[X] Clear History

Recent Jobs

  • Cloud Operations Engineer
    Sage (Atlanta, GA)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2025 Alerted.org