ML Engineer VLLM Inference Jobs

42 jobs (page 1)

Categories

All Categories

Engineering (11)

Software/IT (9)

Management (6)

Senior Principal Machine Learning Engineer…

Red Hat (Boston, MA)

…is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and ... optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/ vllm...components in Go and/or Rust to integrate with the vLLM project and manage distributed inference workloads.… more

Red Hat (01/08/26)
- Related Jobs
Machine Learning Engineer , vLLM…

Red Hat (Raleigh, NC)

…open, and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise ... optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM , you will be...you. Join us in shaping the future of AI Inference ! **What You Will Do** + Write robust Python… more

Red Hat (12/31/25)
- Related Jobs
Senior Principal Machine Learning Engineer…

Red Hat (Boston, MA)

…is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and ... project (https://github.blog/news-insights/octoverse/octoverse-a-new-developer-joins-github-every-second-as-ai-leads-typescript-to-1/#the-top-open-source-projects-by-contributors) on Github. As a Machine Learning Engineer focused on vLLM , you will… more

Red Hat (01/08/26)
- Related Jobs
Senior Software Engineer - vLLM…

Red Hat (Boston, MA)

…is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and ... to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model...scale LLM deployments. We are seeking an experienced Senior ML Ops engineer to work closely with… more

Red Hat (12/06/25)
- Related Jobs
Staff ML Engineer , Inference…

General Motors (Sunnyvale, CA)

…Python, C++ or other relevant coding languages. + Expertise in ML inference , model serving frameworks (triton, rayserve, vLLM etc). + Strong communication ... is eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the...efficiency. **About the Role:** We are seeking a Staff ML Infrastructure engineer to help build and… more

General Motors (10/21/25)
- Related Jobs
Senior Software Development Engineer , AI/…

Amazon (Seattle, WA)

…integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... collaborate across teams to develop innovative optimization techniques * Build online/offline inference serving with vLLM , SGLang, TensorRT or similar platforms… more

Amazon (01/06/26)
- Related Jobs
Software Development Engineer - AI/…

Amazon (Seattle, WA)

…integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... with syntax and tile-level semantics similar to Triton. - Experience with online/offline inference serving with vLLM , SGLang, TensorRT or similar platforms in… more

Amazon (12/31/25)
- Related Jobs
Software Development Engineer AI/ ML…

Amazon (Cupertino, CA)

…the boundaries of what's possible in large-scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to- vllm /releases/tag/2.25.0 ... - Master's degree in computer science or equivalent - Deep expertise in ML Frameworks/Libraries such as JAX, PyTorch, vLLM , SGLang, Dynamo, TorchXLA, TensorRT.… more

Amazon (12/21/25)
- Related Jobs
Senior Software Engineer , AI…

NVIDIA (Santa Clara, CA)

…and passionate about performance engineering in ML frameworks (eg, PyTorch) and inference engines (eg, vLLM and SGLang). + Familiarity with GPU programming ... latest NVIDIA GPU hardware features; profile and optimize the inference framework ( vLLM ) with methods like speculative...building and optimizing LLM inference engines (eg, vLLM , SGLang). + Hands-on work with ML … more

NVIDIA (01/10/26)
- Related Jobs
Lead Engineer , Inference Platform

MongoDB (Palo Alto, CA)

…in multi-tenant environments + 1+ years of experience serving as TL for a large-scale ML inference or training platform SW project **Nice to Have** + Prior ... We're looking for a Lead Engineer , Inference Platform to join our...of experience in managing a technical team focused on ML inference or training infrastructure **Why Join… more

MongoDB (12/27/25)
- Related Jobs

"Alerted.org

Advanced Search

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?