"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • System Development Engineer II, Data Center…

    Amazon (Nashville, TN)



    Apply Now

    Description

    The Systems Development Engineer (SysDev) within Data Center Infrastructure Engineering (DCIE) is responsible for developing automation, monitoring, and analytics solutions to support Amazon’s global IT infrastructure within Fulfillment Centers (FCs). This role focuses on building scalable software systems, infrastructure automation, and data-driven insights to enhance the reliability, efficiency, and performance of Amazon’s power, cooling, and structured cabling infrastructure.

     

    SysDevs collaborate closely with DCIE Engineers (electrical, mechanical, telecom, and general), TPMs, and DCIO Engineers to develop custom tools, dashboards, APIs, and automation frameworks that improve real-time monitoring, predictive maintenance, and operational scalability.

     

    Key job responsibilities

     

    Infrastructure System Development & Automation

     

    - Develop custom automation tools to manage and optimize power, HVAC, and structured cabling infrastructure across Amazon FCs.

     

    - Build scalable APIs, microservices, and integration solutions to streamline infrastructure monitoring and control.

     

    - Create automated deployment frameworks for configuring, provisioning, and managing critical infrastructure components.

     

    - Collaborate with TPMs and DCIE Engineers to enhance infrastructure standardization through code-driven deployments.

     

    Monitoring & Data Analytics

     

    - Design and implement real-time dashboards for power monitoring, HVAC performance, and network infrastructure health.

     

    - Develop data analytics pipelines to process and analyze infrastructure telemetry, supporting predictive maintenance and anomaly detection.

     

    - Work with DCIM teams to integrate asset tracking, power consumption metrics, and infrastructure lifecycle insights.

     

    - Use machine learning and AI-driven analytics to identify trends, prevent failures, and optimize resource allocation.

     

    Operational Support & Incident Response

     

    - Automate incident detection, alerting, and response workflows to minimize downtime and improve infrastructure reliability.

     

    - Support Sev1/Sev2 incident response, developing automated troubleshooting and remediation tools for infrastructure failures.

     

    - Work with DCIO Engineers to enhance on-call operations through software-defined automation.

     

    - Participate in post-incident reviews (PIRs), implementing software-driven solutions to prevent recurrence.

     

    Infrastructure Integration & Standardization

     

    - Develop and maintain OpenDCIM-based systems to track power, cooling, and network infrastructure assets.

     

    - Create tools to enforce compliance with Amazon’s infrastructure standards, reducing manual audits and deployment errors.

     

    - Support integration of Fault-Managed Power (Project Constellation), Fiber Media Conversion (Project Opti-Bridge), and Split CT Power Monitoring solutions into automated workflows.

     

    Collaboration & Process Improvement

     

    - Partner with TPMs to define software solutions for infrastructure lifecycle management, remediation projects, and scalability initiatives.

     

    - Work with DCIE Engineers to implement self-healing infrastructure capabilities through automation and AI-driven insights.

     

    - Continuously improve developer operations (DevOps) and infrastructure automation practices within DCIE.

     

    A day in the life

     

    As an SysDev in DCIE, you’ll begin your day reviewing infrastructure telemetry dashboards and overnight logs to identify anomalies or performance degradations across Amazon Fulfillment Centers. You’ll then sync with Technical Program Managers (TPMs) and DCIE Engineers to prioritize active workstreams, whether it’s scaling power monitoring pipelines, deploying automation for HVAC lifecycle tracking, or integrating new telemetry sources into the DCIM platform. Midday, you’ll be coding—whether building out RESTful APIs, refining predictive maintenance models, or developing automation scripts for infrastructure deployment. You’ll participate in design reviews, contribute to PIRs, and resolve automation or telemetry-related blockers raised by on-call engineers. Your day ends with sprint planning or roadmap check-ins, driving progress on scalable, self-healing infrastructure systems that improve uptime, efficiency, and global visibility for thousands of Amazon sites.

     

    Amazon offers a full range of benefits that support you and eligible family members, including domestic partners and their children. Benefits can vary by location, the number of regularly scheduled hours you work, length of employment, and job status such as seasonal or temporary employment.

    The benefits that generally apply to regular, full-time employees include:

    - Medical, Dental, and Vision Coverage

     

    - Maternity and Parental Leave Options

     

    - Paid Time Off (PTO)

     

    - 401(k) Plan

     

    If you are not sure that every qualification on the list above describes you exactly, we'd still love to hear from you!

     

    At Amazon, we value people with unique backgrounds, experiences, and skillsets. If you’re passionate about this role and want to make an impact on a global scale, please apply!

     

    About the team

     

    The Data Center Infrastructure Engineering (DCIE) team within Ops Technology Infrastructure Engineering (OTIE) designs, standardizes, and sustains scalable, cost-effective, and resilient IT infrastructure for Amazon Fulfillment and Logistics Operations worldwide.

     

    We enable Operations Technology Solutions (OTS) by delivering high-performance power, cooling, structured cabling, edge compute, and automation solutions that ensure reliable and efficient on-premises hardware operations.

     

    Our work spans Demarcation Rooms, MDFs, IDFs, power systems (UPSs, ATSs, PDUs), fault-managed power, cooling and containment, Computers on Wheels (COWs), telecommunications, and distributed edge compute infrastructure to enhance data processing and reduce latency.

     

    Through automation, predictive analytics, and proactive maintenance, DCIE drives operational excellence, minimizes downtime, and scales infrastructure to support Amazon’s rapid growth while aligning with its efficiency, reliability & safety, sustainability, and scalability objectives.

    Basic Qualifications

    - Experience in automating, deploying, and supporting large-scale infrastructure

     

    - Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust

     

    - Experience with Linux/Unix

     

    - Experience with CI/CD pipelines build processes

     

    - 3+ years of experience in software development, systems engineering, or infrastructure automation.

     

    - Proficiency in Python, Java, Go, or another high-level programming language.

     

    - Experience developing RESTful APIs, microservices, and cloud-based infrastructure solutions.

     

    - Strong background in infrastructure automation (Terraform, Ansible, AWS CloudFormation, etc.).

     

    - Familiarity with database systems (SQL, NoSQL, or Time-Series DBs like InfluxDB, Prometheus, etc.).

     

    - Hands-on experience with monitoring and logging platforms (Grafana, Kibana, Splunk, AWS CloudWatch, etc.).

     

    - Experience with DevOps, CI/CD, and version control (Git, GitHub, Jenkins, etc.).

    Preferred Qualifications

    - Experience with distributed systems at scale

     

    - Experience with data center infrastructure monitoring, automation, and DCIM platforms.

     

    - Knowledge of power systems (UPS, ATS, PDU), HVAC, and structured cabling infrastructure.

     

    - Hands-on experience with IoT, telemetry data processing, and edge computing.

     

    - Experience with AI/ML applications for predictive maintenance and anomaly detection.

     

    - Background in network automation and SDN (Software-Defined Networking).

     

    Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

     

    Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

     


    Apply Now



Recent Searches

  • SAP Hana Data Modeler (Maryland)
  • instructional coach lbs1 required (United States)
  • Cyber Security Incident Analyst (Virginia)
  • animal support technician ii (United States)
[X] Clear History

Recent Jobs

  • System Development Engineer II, Data Center Infrastructure Engineering (Dcie)
    Amazon (Nashville, TN)
  • CMM Programmer / Inspector
    Marotta Controls, Inc. (Montville, NJ)
  • Host
    PF Changs (Mount Pleasant, SC)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2025 Alerted.org