-
Principal Hardware Quality Engineer
- Microsoft Corporation (Redmond, WA)
-
Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, high-energy engineers to help achieve that mission.
As Microsoft's cloud business continues to grow the ability to deploy new offerings and hardware infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Hardware, Infrastructure Management, and Fundamentals Engineering (HIFE) team is instrumental in defining and delivering operational measures of success for hardware manufacturing, improving the planning process, quality, delivery, scale and sustainability related to Microsoft cloud hardware. We are looking for seasoned engineers with a dedicated passion for customer focused solutions, insight and industry knowledge to envision and implement future technical solutions that will manage and optimize the Cloud infrastructure.
We are looking for a **Principal Hardware Quality Engineer** to join the team.
Responsibilities
+ Hands on debug in data center (onsite and virtual)
+ Develop and implement a robust supplier quality management strategy to ensure the data center hardware is manufactured at the highest level of quality standards.
+ Leadership to work across data centers, development, and supplier to resolve critical & high severity issues.
+ Conduct hands on debug in global data centers (onsite and virtual) including GPU sub-system failure analysis.
+ Drive the continuous improvement process based on Root Cause Analysis (RCA) and identified opportunities.
+ Manage multiple NPI builds and quality phase-gate deliverables for the manufacturing team throughout the engineering development lifecycle, from concept through production readiness.
+ Establish Critical-to-Quality performance metrics to measure and improve product quality.
+ Act as the voice of quality in the hardware change management process, ensuring quality requirements are considered and met.
Qualifications
Required Qualifications:
+ Doctorate Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 5+ years technical engineering experience
+ OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 7+ years technical engineering experience
+ OR Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 8+ years technical engineering experience.
+ 8+ years of work experience in managing product quality in the electronic industry.
+ 5+ years of direct engineering experience in hardware system issue resolution for GPU Servers.
+ 3+ years experience with query languages like SQL applicable to debug data, like telemetry and logs to identify and investigate HW failure signatures.
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
+ Master’s degree in Electrical Engineering, Software Engineering, Or System Engineering.
+ OR 12+ years equivalent experience.
+ Patent or track record of engineering excellency.
+ Experience with Liquid Cooling Systems in Data Centers
+ 12+ years of experience in working with the modern server architectures – includes understanding of GPU, GPU system hardware, Memory or CPU and methods for failure analysis, debugging or validation.
+ 12+ years of proven success of leading resolution of critical quality issues across data centers
+ 8+ years of system level server debugging with an understanding of power, system and network environments
+ 3+ years of direct GPU related engineering experience in issue debug/test log review.
+ Leadership skills and ability to collaborate with diverse teams and drive a call to action.
Reliability Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: US corporate pay information | Microsoft Careers (https://careers.microsoft.com/v2/global/en/us-corporate-pay.html)
Microsoft will accept applications for the role until Oct 29th, 2025.
\#azurehwjobs \#HIFE #SCHIE
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .
-
Recent Searches
- Systems Integration Technician R (California)
- Summer 2026 Quality Engineering (Georgia)
- Lead Software Infrastructure Engineer (Virginia)
Recent Jobs
-
Principal Hardware Quality Engineer
- Microsoft Corporation (Redmond, WA)