-
Software Development Engineer
- Amazon (Seattle, WA)
-
Description
We're enhancing the shopping experience on Amazon through the conversational capabilities of large language models, and we're looking for innovative professionals who are passionate about technology and customer experience. You'll have the opportunity to drive breakthrough innovations in LLM inference efficiency while working alongside talented scientists, engineers, and technical program managers (TPMs) to create solutions that serve our customers.
If you're excited about optimizing the computational heart of AI systems, collaborating with a dynamic team, and contributing to this evolving field, we'd love to have you join our mission to unlock unprecedented LLM performance!
Key job responsibilities
We're looking for an experienced Software Development Engineer with deep expertise in GPU/customized chip kernel optimization and ML inference acceleration to lead projects in architecting, designing, developing, and optimizing high-performance kernel implementations for large language model. You'll guide your team in creating and optimizing innovative kernels, custom operators, and low-level optimizations that maximize hardware utilization and minimize computational overhead.
In this role, you'll establish best practices for kernel development, memory management, and parallel computing that dramatically reduce inference latency and boost throughput for transformer-based models. You'll work with your team to develop kernel fusion techniques, attention mechanism optimizations, and matrix multiplication accelerations at scale, partnering with engineers and scientists in a fast-paced environment to deliver measurable performance gains. You'll also drive technical roadmap, performance benchmarking, and optimizations focused on kernel-level improvements.
Basic Qualifications
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Experience with Machine and Deep Learning toolkits such as MXNet, TensorFlow, Caffe and PyTorch
- Experience with CUDA, cuDNN, cuBLAS and other GPU kernel-level optimization techniques
Preferred Qualifications
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Experience in Neuron hardware (Inferentia and Trainium chips) and NKI kernel optimization
- 3+ years of hands-on experience with CUDA programming and GPU kernel development
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $151,300/year in our lowest geographic market up to $261,500/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits . This position will remain posted until filled. Applicants should apply via our internal or external career site.
-
Recent Searches
- Junior Full Stack Developer (Alabama)
- Subassembly Tech 2nd Shift (Florida)
- Adjunct Assistant Professor Economics (United States)
- Senior Configuration Professional Workday (Michigan)
Recent Jobs
-
Software Development Engineer
- Amazon (Seattle, WA)
-
Analytical Engineer III
- Techtronic Industries North America, Inc. (Anderson, SC)
-
Senior Vice President, Data Strategy
- Publicis Groupe (New York, NY)