-
Systems Validation Manager, Annapurna Labs
- Amazon (Austin, TX)
-
Description
AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for customers who require specialized security solutions for their cloud services.
Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.
In Annapurna Labs we are at the forefront of hardware/software co-design not just in Amazon Web Services (AWS) but across the industry. The MLA Technology team is looking for candidates interested being at the forefront of new technology validation and Product Introduction. The scope of this role includes the complete vertical stack of Silicon, PCB, High Speed components e.g., HBM, PCIe and Chip to Chip, inter-systems and system to system. Diving deep into new technology hardware components and and scaling technologies that power our Machine Learning boards and servers at scale.
You’ll provide leadership in the application of new technologies to large scale deployments in a continuous effort to deliver a world-class customer experience. This is a fast-paced, intellectually challenging position, and you’ll work with thought-leaders in multiple technology areas. You’ll have high standards for yourself and everyone you work with, and you’ll be constantly looking for ways to improve our products' performance, quality and cost. We’re changing an industry, and we want individuals who are ready for this challenge and want to reach beyond what is possible today.
AWS is the world’s leading and most trusted provider of virtualized public cloud utility services. We offer our global IT customer base who span private, corporate and government sectors, over 100 fully featured, integrated services in Gen AI, compute, storage, database, analytics, mobile, Internet of Things (IOT) and enterprise applications. AWS operates a worldwide fleet of interconnected enterprise data centers at hyperscale to deliver the capacity that powers our customers IT infrastructure which enables their ability to concentrate on core competencies through agility and operational efficiency. To learn more about AWS, visit https://aws.amazon.com
Key job responsibilities
As a Validation manager, you are responsible for validating the chip and system architecture for the next generation Machine Learning Acceleration (MLA) product family. You work closely with design, software & architecture teams and develop in-depth chip and system validation plans, execute on the validation plans, debug and take the MLA product to successful production ramp. This includes validating the design to achieve the best performance, identify bottlenecks and work with design/architecture teams to resolve them. Our work does not stop at production ramp, we continue to support our servers in the datacenter through their lifecycle.
* Scale and manage a team of System Validation Engineers
* Architect validation strategies
* Develop validation methodologies and infrastructures for validating neural networks
* Evaluate and improve performance of Machine Learning systems
* Work with other functional teams to ensure delivery of high quality systems to AWS data centers
* Own validation schedules and reporting of progress metrics
* Provide technical mentoring for the team.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and leadership development. We care about your career growth and strive to assign projects that help our team members develop your leadership and technical expertise so you feel empowered to take on more complex tasks in the future.
Basic Qualifications
- 3+ years of engineering team management experience
- 7+ years of working directly within engineering teams experience
- 3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
- 8+ years of leading the definition and development of multi tier web services experience
- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
- Experience partnering with product or program management teams
Preferred Qualifications
- Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy
- Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
-
Recent Jobs
-
Systems Validation Manager, Annapurna Labs
- Amazon (Austin, TX)
-
FlexPLM Senior Programmer Analyst
- L. L. Bean, Inc. (Freeport, ME)