- Amazon (Seattle, WA)
- …designed by Annapurna Labs inside AWS. The Neuron SDK consists of a compiler , runtime, frameworks, and tooling customers need. It's also preinstalled in AWS Deep ... Learning AMIs and Deep Learning Containers for customers to quickly get started with running high performance and cost-effective inference and training. The Neuron team is hiring senior Runtime Software Development Engineers with a background in machine… more
- Microsoft Corporation (Redmond, WA)
- …or app level components. + Demonstrated mastery in ML compiler design, hardware-aware optimizations, and scalable infrastructure across heterogeneous platforms. ... Software Engineering IC6 - The typical base pay range for this role across the US is USD $163,000 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and… more
- Pacific Northwest National Laboratory (Richland, WA)
- …experience developing and implementing quantum algorithms and contributing to compiler design for quantum applications + Practical background in superconducting ... quantum device calibration, characterization, and maintenance, ensuring high-fidelity system performance + Strong foundation in quantum noise models and/or the design and application of quantum error correction codes, with emphasis on reliability and… more
- Amazon (Seattle, WA)
- …of Experts, etc. The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices ... across a range of models such as Llama 3.3 70B, 3.1 405B, DBRX, Mixtral, and so on. Key job responsibilities Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as… more
- Amazon (Redmond, WA)
- …and optimization - Back end tool experiences a plus, including Fusion Compiler , PrimeTime, or equivalent - Strong written and verbal communication skills Amazon ... is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other… more
- Amazon (Seattle, WA)
- …of a vertically integrated system stack consisting of the PyTorch inference library, Neuron compiler , runtime and collectives. A day in the life You will work with ... your senior management and technical leaders to define the building blocks for the latest LLMs, build and deliver them to customers. You will manage changing priorities as new models and new technologies emerge, and you adapt your team's work to manage them.… more
- Amazon (Seattle, WA)
- …The Inference Model Enablement and Generality team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference ... solutions with Trainium and Inferentia. Experience optimizing LLM inference performance for both latency and throughput is highly desired. Experience with distributed inference libraries such as vLLM is a bonus. Key job responsibilities This role will help… more
- Amazon (Bellevue, WA)
- …multimodal models for resource-efficient deployment * Work closely with compiler engineers, hardware architects, data collection, and product teams A ... day in the life As an Applied Scientist with the Silicon and Solutions Group Edge AI team, you'll contribute to science solution design, conduct experiments, explore new algorithms, develop embedded inference pipelines, and discover ways to enrich our customer… more