- Amazon (Cupertino, CA)
- Software Development Engineer AI /ML, Inference Serving, AWS Neuron AWS Neuron is the software stack powering AWS Inferentia and Trainium machine ... learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure...and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect… more
- Menlo Ventures (San Francisco, CA)
- …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge ... capabilities of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our… more
- Amazon (Seattle, WA)
- …cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... Overview AWS Neuron is the complete software stack for the AWS Inferentia and Trainium...and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of… more
- jobr.pro (Sunnyvale, CA)
- …UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with ... Large Language Models (LLM) and other Machine Learning (ML) models for inference . Experience building GPU-related software . Experience with compilers or ML… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI /ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI /ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... of applied scientists, system engineers, and product managers to deliver state‑of‑the‑art inference capabilities for Generative AI applications. Your work will… more
- Amazon (San Francisco, CA)
- …technology company in Herndon, Virginia is seeking a Senior Software Development Engineer to work on AI /ML projects. You will design and optimize machine ... learning models for deployment on custom hardware accelerators, ensuring maximum performance. Ideal candidates will have over 5 years of experience, strong Python and C++ skills, and knowledge in machine learning principles. This role fosters a collaborative… more
- Amazon (San Francisco, CA)
- A leading e-commerce platform in San Francisco is seeking a Software Development Engineer to develop and optimize machine learning models for custom hardware ... accelerators. This role involves performance tuning, debugging, and close collaboration with customers to enhance their models on AWS's services. The ideal candidate has strong programming skills in C++ and Python, along with a solid understanding of machine… more
- Capital One (Fredericksburg, VA)
- Lead AI Engineer (FM Hosting, LLM Inference ) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For ... customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model… more
- quadric.io, Inc (Burlingame, CA)
- …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network... AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric… more