- Amazon (Cupertino, CA)
- …collaborate across compiler , runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the ... to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library… more
- NVIDIA (Santa Clara, CA)
- …Flash Attention) + Expertise in inference engines like vLLM and SGLang + Expertise in machine learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU ... We are now looking for a Senior Deep Learning Software Engineer , FlashInfer. NVIDIA has...out from the crowd: + Background in domain specific compiler and library solutions for LLM inference and training… more
- Amazon (Seattle, WA)
- …our cloud-scale Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, ... Training team works side by side with chip architects, compiler engineers and runtime engineers to create, build and...effectively within cross-functional teams, and a solid foundation in Machine Learning are critical for success in… more
- Amazon (Cupertino, CA)
- …our cloud-scale Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, ... Training team works side by side with chip architects, compiler engineers and runtime engineers to create, build and...effectively within cross-functional teams, and a solid foundation in Machine Learning are critical for success in… more
- Amazon (Cupertino, CA)
- …of talent, we have been able to improve AWS cloud infrastructure in high-performance machine learning with AWS Neuron, Inferentia and Trainium ML chips, in ... Tools team * Work closely with the frameworks and compiler teams. * Collect requirements from various other teams...is the software of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML… more
- Amazon (Cupertino, CA)
- …and the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... Web Services (AWS) is looking for a Software Development Engineer II to build, deliver, and maintain complex products...Training team works side by side with chip architects, compiler engineers and runtime engineers to create , build… more
- Amazon (Seattle, WA)
- …and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... The Distributed CoreTech training team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
- Google (Sunnyvale, CA)
- …evaluating GPU systems for comparative analysis and benchmarking for Google's internal Machine Learning (ML) workloads. We strive for extracting maximum ... GPU Performance Engineer _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid**...either an academic or industry setting. + Experience with compiler improvement, code generation and runtime systems for GPU… more
- quadric.io, Inc (Burlingame, CA)
- …network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and ... conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently… more
- Amazon (New York, NY)
- …of talent, we have been able to improve AWS cloud infrastructure in high-performance machine learning with AWS Neuron, Inferentia and Trainium ML chips, in ... is the software of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML...AWS Neuron. Neuron is a Software that include ML compiler and native integration into popular ML frameworks. Our… more