"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • Principal Software Engineer, CoreAI

    Microsoft Corporation (Redmond, WA)



    Apply Now

    Overview

     

    CoreAI is at the forefront of Microsoft’s mission to redefine how software is built and experienced. We are responsible for building the foundational platforms, services, programming models, and developer experiences that power the next generation of applications using Generative AI. Our work enables developers and enterprises to harness the full potential of AI to create intelligent, adaptive, and transformative software.

     

    The AI Core Infrastructure team, part of AI Platform team in CoreAI Organization is responsible for large-scale, highly reliable and efficient GPU management infrastructure and the inference and training platforms that power all of Microsoft’s AI workloads, such as M365 CoPilot, Github CoPilot, Microsoft CoPilot, AI Foundry’s Inference and Fine-Tuning offering of OAI and OSS models, and many more.

     

    As a Principal Engineer on the Observability team, you’ll shape the architecture and strategy on how customers monitor, troubleshoot, and scale their AI training workloads. You’ll work across ML infrastructure, distributed systems, and observability to power large-scale pre-training, post-training, and fine-tuning on some of the world’s largest AI supercomputers.

     

    Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

     

    In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

    Responsibilities

    As the Principal engineer on the Observability team, your responsibilities include:

    + Set the roadmap and drive the execution of the Observability platform built for AI workloads at a supercomputer scale.

    + Deliver deep insights that empower customers to troubleshoot and optimize their large-scale AI workloads

    + Leverage production telemetry to influence next-generation infrastructure design, boosting efficiency, reliability, and performance

    + Mentor and guide engineering teams, elevating technical excellence and championing a customer-focused approach to system design.

    Qualifications

    Required Qualifications:

    + Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.

    + 6+ years of experience building or operating distributed systems, with a strong focus on reliability, scalability, and performance.

    + Proficiency in one or more programming languages such as C#, C++, Go, or Python.

    + Strong understanding of Docker, Kubernetes, scalable architectures, and automation for production systems.

    Other Requirements:

    + Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

    + Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

    Preferred Qualifications:

    + Excellent analytical and problem-solving skills, with the ability to extract customer pain points, synthesize ambiguous requirements, and design clear, scalable solutions.

    + Expertise with distributed observability technologies (e.g., Prometheus, OpenTelemetry, Grafana) and 2+ years of experience designing or scaling telemetry pipelines for high-throughput production systems.

    + Advanced, hands-on experience with production ML systems.

     

    Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.

     

    Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

     

    https://careers.microsoft.com/us/en/us-corporate-pay

     

    Software Engineering IC6 - The typical base pay range for this role across the U.S. is USD $163,000 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 - $331,200 per year.

     

    Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

     

    https://careers.microsoft.com/us/en/us-corporate-pay

     

    This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

     

    Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations. (https://careers.microsoft.com/v2/global/en/accessibility.html)

     


    Apply Now



Recent Searches

  • Sales Engineer Development Program (United States)
  • Software Engineer Workday Integration (New York)
  • office coordinator hospital operations (United States)
  • Machine Learning Engineer (Alabama)
[X] Clear History

Recent Jobs

  • Principal Software Engineer, CoreAI
    Microsoft Corporation (Redmond, WA)
  • MICU RN III
    Catholic Health Initiatives (Houston, TX)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2026 Alerted.org