- General Motors (Warren, MI)
- …**About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, ... reliable, and cost-efficient platform that powers GM's AI efforts. We're proud to serve as the ...the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is...deep learning, enabling breakthroughs in areas like LLM, Generative AI , Recommenders and Vision that have put DL into… more
- US Bank (Irving, TX)
- …deployments with MLOps tools such as MLflow, Kubeflow, and Airflow. **Generative‑ AI Enablement** Deploy, fine‑tune, prompt‑ engineer , and scale large language ... and product teams. **Continuous Learning & Innovation** Stay current with cutting‑edge research and integrate state‑of‑the‑art AI /ML techniques whenever they add… more
- MongoDB (Palo Alto, CA)
- …the world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge research into production at scale + Solve ... We're looking for a Lead Engineer , Inference Platform to join our team building...for embedding models that power semantic search, retrieval, and AI -native features across MongoDB Atlas. This role is part… more
- Red Hat (Boston, MA)
- …Summary:** The Red Hat Ecosystems Engineering group is seeking a Senior Principal Software Engineer in our Boston, MA office. In this role, you will work with a ... diverse team of highly motivated engineers on designing and implementing AI /ML workflows and solutions and integrating Partners solutions. You will also be working… more
- Red Hat (Raleigh, NC)
- …audiences, at conferences, and through blogs. + Develop and adapt quickly to a modern AI technology stack, including vLLM , llm-d, Agentic AI , LLMs, PyTorch, ... team, you will play a key role in shaping the future of AI -influenced software and platform architectures. You'll develop and deliver capabilities that support Red… more
- Palo Alto Networks (Santa Clara, CA)
- …Prisma AIRS, Palo Alto Networks is building the world's most comprehensive AI security platform. Organizations are increasingly building complex ecosystems of AI ... cannot address. In response, Prisma AIRS delivers model security, posture management, AI red teaming, and runtime protection. Our customers can confidently deploy … more
- NVIDIA (Santa Clara, CA)
- …design to keep pace + Collaborate across the company to guide the direction of AI Inferencing, working with software, research and product teams What we need to ... We are now looking for a Principal Software Engineer , TensorRT-LLM ! NVIDIA is hiring experienced principal...world are using GPUs to power a revolution in AI , enabling breakthroughs in areas like content creation, code… more
- NVIDIA (Santa Clara, CA)
- …and finetuning with mixed precision recipes on next-gen NVIDIA GPU architectures. + Research , prototype, and develop robust and scalable AI tools and pipelines. ... highly optimized solutions. What you'll be doing: + Develop algorithms for AI /DL, data analytics, machine learning, or scientific computing + Contribute and advance… more
- NVIDIA (Santa Clara, CA)
- …and finetuning with mixed precision recipes on next-gen NVIDIA GPU architectures. + Research , prototype, and develop robust and scalable AI tools and pipelines. ... NVIDIA is looking for engineers for our core AI Frameworks (Megatron Core (https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/core) and NeMo Framework… more