- Amazon (Cupertino, CA)
- …collaborate across compiler , runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the ... used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed… more
- Amazon (Cupertino, CA)
- …collaborate across compiler , runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the ... used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed… more
- NVIDIA (Santa Clara, CA)
- …Flash Attention) + Expertise in inference engines like vLLM and SGLang + Expertise in machine learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU ... We are now looking for a Senior Deep Learning Software Engineer, FlashInfer....out from the crowd: + Background in domain specific compiler and library solutions for LLM inference and training… more
- Amazon (Cupertino, CA)
- Description The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference ... and JAX. Your role will involve working closely with our custom-built Machine Learning accelerators, including Inferentia and Trainium, which represent the… more
- Amazon (Cupertino, CA)
- …Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and servers that use them. This role is for ... a software engineer in the Machine Learning Inference Model Enablement team for...Inference Model Enablement team works side by side with compiler engineers and runtime engineers to create, build and… more
- NVIDIA (Santa Clara, CA)
- …alignment + Enable team collaboration across the company to guide the direction of machine learning , working with software, research and product teams. What we ... multi-functional teams. + Work with HW architecture teams, Deep Learning SW teams, Compiler and Infrastructure teams...post-Si chip development and verification. + Technical background with machine learning , deep learning , open… more
- Google (Sunnyvale, CA)
- …Google (https://careers.google.com/benefits/) . **Responsibilities** + Explore and define future Machine Learning (ML) accelerator system and chip architecture ... Senior Software Engineering Manager, TPU Performance _corporate_fare_ Google...of experience with one or more of the following: Speech /audio (eg, technology duplicating and responding to the human… more
- Amazon (Cupertino, CA)
- …Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators. As a part of the Neuron Frameworks team ... for ML model developers. A successful candidate will have a experience developing Machine Learning infrastructure and/or ML Frameworks, a demonstrated ability to… more
- NVIDIA (Santa Clara, CA)
- …inference pipeline. + Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams + ... are using GPUs to power a revolution in deep learning -powered AI, enabling breakthroughs in areas like LLM, ChatGPT...Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation + Prior… more
- Amazon (Cupertino, CA)
- …that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is responsible for ... software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and the Trn1 and...Training team works side by side with chip architects, compiler engineers and runtime engineers to create , build… more