Engineering Manager VLLM Inference Jobs | Alerted.org

Engineering Manager , vLLM…

Red Hat (Boston, MA)

…power of open-source LLMs and vLLM to every enterprise. Red Hat Inference engineering team accelerates AI for the enterprise and brings operational ... to build, optimize, and scale LLM deployments. As an Engineering Manager of the Machine Learning Engineering team focused on vLLM , you will be… more

Red Hat (08/27/25)
- Related Jobs
Senior Deep Learning Manager , LLM…

NVIDIA (Santa Clara, CA)

…will be responsible for managing a team that characterizes the latest LLMs and inference servers like TensorRT-LLM, vLLM , and SGLang to ensure that NVIDIA ... at the forefront of the generative AI revolution. The Inference Benchmarking (IB) team specifically focuses on advanced ...to deep learning software projects, such as PyTorch, TRT-LLM, vLLM , and SGLang to drive advancements in the field.… more

NVIDIA (09/02/25)
- Related Jobs
Product Manager - Inference

NVIDIA (Santa Clara, CA)

…evolving, with new acceleration algorithms, usecases, and deployment techniques. As a Product Manager for AI Platform Inference you will be responsible for ... strategy What we need to see: + Experience with Inference deployment and optimization software (ex. vLLM ,...+ BS or MS degree in Computer Science, Computer Engineering , or similar experience (or equivalent experience) + 5+… more

NVIDIA (06/13/25)
- Related Jobs
Manager , Software Engineering…

NVIDIA (Santa Clara, CA)

… Manager to lead the development for the Dynamo engineering team, NVIDIA's high-performance, low-latency inference platform for serving generative ... be doing: + Mentor, grow, and develop the Dynamo engineering team and be responsible for planning and execution...inference systems. + Experience with LLM frameworks like vLLM & TRT-LLM. NVIDIA is widely considered to be… more

NVIDIA (08/30/25)
- Related Jobs
Senior Manager , Data Science - GenAI…

Capital One (San Jose, CA)

Senior Manager , Data Science - GenAI Digital Assistant Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually ... multi-agentic workflow, domain specific conversational large language model tuning and inference optimization. **In this role, you will:** + Partner with a… more

Capital One (08/13/25)
- Related Jobs
Senior Machine Learning Engineer - Model Training…

Red Hat (Boston, MA)

…PEFT, etc.). Familiarity with distributed training frameworks (eg FSDP, DeepSpeed) and inference runtimes (eg vLLM ). Experience in open-source projects and ... to democratize AI with open source! Red Hat's Global Engineering Team is looking for a Senior Machine Learning...with production needs. This position reports directly to the Manager of AI Innovation. This position may require occasional… more

Red Hat (09/04/25)
- Related Jobs

"Alerted.org

Advanced Search

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?