- Red Hat (Boston, MA)
- …power of open-source LLMs and vLLM to every enterprise. Red Hat Inference engineering team accelerates AI for the enterprise and brings operational ... to build, optimize, and scale LLM deployments. As an Engineering Manager of the Machine Learning Engineering team focused on vLLM , you will be… more
- NVIDIA (Santa Clara, CA)
- …will be responsible for managing a team that characterizes the latest LLMs and inference servers like TensorRT-LLM, vLLM , and SGLang to ensure that NVIDIA ... at the forefront of the generative AI revolution. The Inference Benchmarking (IB) team specifically focuses on advanced ...to deep learning software projects, such as PyTorch, TRT-LLM, vLLM , and SGLang to drive advancements in the field.… more
- NVIDIA (Santa Clara, CA)
- …evolving, with new acceleration algorithms, usecases, and deployment techniques. As a Product Manager for AI Platform Inference you will be responsible for ... strategy What we need to see: + Experience with Inference deployment and optimization software (ex. vLLM ,...+ BS or MS degree in Computer Science, Computer Engineering , or similar experience (or equivalent experience) + 5+… more
- NVIDIA (Santa Clara, CA)
- … Manager to lead the development for the Dynamo engineering team, NVIDIA's high-performance, low-latency inference platform for serving generative ... be doing: + Mentor, grow, and develop the Dynamo engineering team and be responsible for planning and execution...inference systems. + Experience with LLM frameworks like vLLM & TRT-LLM. NVIDIA is widely considered to be… more
- Capital One (San Jose, CA)
- Senior Manager , Data Science - GenAI Digital Assistant Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually ... multi-agentic workflow, domain specific conversational large language model tuning and inference optimization. **In this role, you will:** + Partner with a… more
- Red Hat (Boston, MA)
- …PEFT, etc.). Familiarity with distributed training frameworks (eg FSDP, DeepSpeed) and inference runtimes (eg vLLM ). Experience in open-source projects and ... to democratize AI with open source! Red Hat's Global Engineering Team is looking for a Senior Machine Learning...with production needs. This position reports directly to the Manager of AI Innovation. This position may require occasional… more