- Red Hat (Boston, MA)
- …platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM , you will be at the forefront of ... will do** + Write robust Python and C++, working on vLLM systems, high performance machine learning primitives, performance analysis and modeling, and… more
- Red Hat (Boston, MA)
- …to build, optimize, and scale LLM deployments. As a Principal Machine Learning Engineer focused on distributed vLLM (http://github.com/ vllm -project/) ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...and guide fellow engineers, fostering a culture of continuous learning and innovation. **What you will bring** + Strong… more
- Red Hat (Boston, MA)
- …you will do** + Collaborate with research and product development teams to scale machine learning products for internal and external applications + Create and ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...learning products and software. As an ML Ops engineer , you will work closely with our technical and… more
- Red Hat (Boston, MA)
- …a stable platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on llama.cpp, you will be at the ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...challenges in model performance and efficiency. Your work with machine learning and high performance computing will… more
- NVIDIA (Santa Clara, CA)
- At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world's most challenging problems. We're ... inference backends and compilers for GPUs. + Knowledge of Machine Learning techniques and GPU programming with...Background in working with LLM inference frameworks like TensorRT-LLM, vLLM , SGLang. + Experience working with deep learning… more
- Amazon (New York, NY)
- …talented, and inventive Senior Research Engineer with a strong hands-on machine learning background, to lead the development of industry-leading multimodal ... upon industry leading frameworks (NeMo, Megatron Core, PyTorch, Jax, vLLM , TRT, etc) - Work with other team members...an engineering team - 2+ years of expertise in Machine Learning and/or Model Training. Preferred Qualifications… more
- NVIDIA (Santa Clara, CA)
- …+ Expertise in inference engines like vLLM and SGLang + Expertise in machine learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel ... We are now looking for a Senior Deep Learning Software Engineer , FlashInfer. NVIDIA has...teams + Contributing to open source communities like FlashInfer, vLLM , and SGLang What we need to see: +… more
- Amazon (Seattle, WA)
- …(Inf1/Inf2) our cloud-scale Machine Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: - The ... tools builders already love-PyTorch, JAX, and the rapidly evolving vLLM ecosystem. By weaving Neuron SDK deep into these...- Frameworks, Distributed Training, or Inference - to enhance machine learning capabilities on AWS's specialized AI… more
- S&P Global (Cambridge, MA)
- …is looking for ML Engineer interns to join the group of Machine Learning Engineers working on developing a cutting-edge GenAI platform, LLM-powered ... Kensho is S&P Global's hub for AI innovation and transformation. With expertise in Machine Learning and data discovery, we develop and deploy novel solutions for… more
- NVIDIA (Santa Clara, CA)
- …Computer Engineering, or a related field (or equivalent experience) + Experience in deep learning or applied machine learning + Strong foundation in deep ... the world. We are now looking for a Deep Learning Algorithms Engineer ! We are seeking a...model optimization and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM , SGLang. As NVIDIA makes inroads into the Datacenter… more