"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • SiteOps Data Center Production Operations Engineer

    Meta (Henrico, VA)



    Apply Now

    Summary:

    Meta is seeking a forward thinking experienced engineer to join the Production Operations team within our Data Centers. These Data Centers are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our innovative services are delivered. Meta is at the leading edge of the global data center industry both in terms of how data centers are designed and operated. This person should enjoy working in a fast paced, technical environment where adaptability and flexibility will be key to their success. We seek an IT professional with advanced, hands-on technical skills in server hardware and Linux - ideally in a Data Center environment. Having broad knowledge of server administration and participating in projects in a large-scale distributed data center environment is a core competency of this individual. The candidate should also have working knowledge and experience in a few of the following core areas: Hardware repair, OS management, Tooling and Automation, Networking, or Technical Project Management.

    Required Skills:

    SiteOps Data Center Production Operations Engineer Responsibilities:

    1. Support platform health by successfully resolving and closing tickets, while addressing the overall issue (i.e. addressing root cause) including, but not limited to, remote troubleshooting and physical inspection of services in data halls

    2. Participate in root cause analysis of highly technical issues within the data center, ranging from automated tooling to hardware failures and network issues

    3. Collaborate with cross-functional teams on projects and initiatives related to topics such as process, hardware and automation

    4. Point of contact for the introduction of new platforms and hardware to the site, in collaboration with partners and global resources, accelerating the time it takes to bring these products to sustained mass production

    5. Use tools and data analysis effectively to identify issues. Take actions to communicate with all stakeholders appropriately and manage or escalate as needed

    6. Identify corrective actions of hardware issues, work with internal teams and vendors

    7. influence future design changes to ensure ease of serviceability

    8. Solve systemic hardware and/or software issues at scale using scripting, automation, and tooling to drive global resolution

    9. Continuously evaluate and identify areas for improvement in processes, tools, and systems to optimize efficiency and quality of repairs

    10. Use data analytics to drive maximum server up-time and utilization rates, understanding hardware failure rates and service level agreements

    11. Support and train team members to evaluate and identify better ways to resolve issues, and define updates to tools and processes

    12. Provide engineering support and be a go-to technical resource for the team, leadership, and cross-functional teams in operating and maintaining data center servers

    13. Maintain and update documentation i.e. procedures, runbooks and guides

    14. Build cross functional relationships and influence policies and procedures that improve global data center operations

    15. Participate in 24/7 on-call rotation

    16. Travel up to 15% of the time

    Minimum Qualifications:

    Minimum Qualifications:

    17. BS, BA or BEng in technical field or commensurate experience

    18. 5+ years of technical IT experience within an infrastructure environment, in a role such as Systems Administrator, DevOps Engineer, or Site Reliability Engineer

    19. Intermediate-level understanding in Linux (or equivalent OS) in a complex IT environment with the capacity to triage, debug, and troubleshoot server issues

    20. Hands-on experience and knowledge of server hardware and components, including storage

    21. Intermediate-level knowledge of the interdependencies of data center functions and technologies including electrical, cooling, structured cabling, security, and network

    22. Experience managing technical issues and driving to the root cause

    23. Experience participating in technical projects related to areas such as process improvement, technology, and/or automation

    24. Capacity to communicate effectively, in a clear and concise manner, appropriately tailoring messages to the audience

    25. Intermediate-level knowledge of technologies such as HTTP, DNS, RAID, and DHCP

    26. Experience in providing technical guidance to external vendors

    27. Experience in debugging, modifying and developing commonly used scripting or programming languages in at least one of these languages: Bash, PHP, Python, SQL, Rust, Go or Perl

    28. Knowledge of out-of-band/lights-out server communication methods, such as IPMI and serial console

    29. Experience using data and metrics to drive decisions

    Preferred Qualifications:

    Preferred Qualifications:

    30. Experience in fostering growth in others, and driving influence across all organizational levels.

    31. Experience in a large-scale data center environment.

    32. Six Sigma knowledge/certification.

    33. Experience with large-scale AI implementations.

    Public Compensation:

    $40.38/hour to $61.06/hour + bonus + equity + benefits

     

    **Industry:** Internet

    Equal Opportunity:

    Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

     

    Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].

     


    Apply Now



Recent Searches

  • fleet service agent ft (United States)
  • promotions associate part time (United States)
  • genesys engineer (United States)
  • sales consultant end user (United States)
[X] Clear History

Recent Jobs

  • SiteOps Data Center Production Operations Engineer
    Meta (Henrico, VA)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2025 Alerted.org