- LinkedIn (Mountain View, CA)
- …PyTorch, DeepSpeed, GNNs, Flash Attention. PyTorch Lightning and more and more. Model Serving Infrastructure: this team builds low latency high performance ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more
- DoorDash (San Francisco, CA)
- …Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation of our inference ... modern inference optimizations into production - Operationalize advances from the ML serving ecosystem (eg efficient caching, attention optimizations, batching,… more
- Oracle (Sacramento, CA)
- …Cloud's AI Infra offerings + Design and implement scalable orchestration for serving and training AI/ ML models, Model Parallelism & Performance across ... optimizing large-scale distributed training/inference workloads + Have deep understanding of AI/ ML workflows, encompassing data processing, model training, and… more
- LinkedIn (Mountain View, CA)
- …the Feature Store, and serving feature data with high performance. Model Serving Infrastructure: this team builds low latency high performance applications ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more
- LinkedIn (Mountain View, CA)
- …the Feature Store, and serving feature data with high performance. Model Serving Infrastructure: this team builds low latency high performance applications ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more
- MongoDB (Palo Alto, CA)
- …routing, and model health monitoring + Collaborate with peers across ML , infra , and product teams to define architectural patterns and operational ... and low latency at scale + Guide decisions on model serving architecture using tools like vLLM,...or retrieval-augmented generation (RAG) + Contributions to relevant open-source ML serving infrastructure + 1+ years of… more
- Insight Global (San Jose, CA)
- …KV-cache tuning, and using efficient attention mechanisms like Flash Attention. Scalable Model Serving : Understanding of how to deploy models at scale, ... Privacy Policy: https://insightglobal.com/workforce-privacy-policy/. Skills and Requirements *3-5 years in ML /AI engineering roles owning training and/or serving … more
- Oracle (Sacramento, CA)
- …knowledge of IaaS/PaaS industry and competitive capabilities. Experience with popular model training and serving frameworks like KServe, KubeFlow, Triton ... Transformers). + Experience in diagnosing, fixing, and resolving issues in AI model training and serving . **Responsibilities** **Responsibilities** As part of… more
- LinkedIn (Mountain View, CA)
- … infra and platform teams to optimize retrieval and serving efficiency, including embedding optimization, adaptive caching, and parameter-efficient fine-tuning. + ... the company. Our team works on a wide range of cutting-edge ML : LLM fine tuning, text generation, LLM-as-a-judge, prompt engineering, embedding-based retrieval, and… more
- Walmart (Sunnyvale, CA)
- …delivery** . + Drive innovation in **auction algorithms, dynamic bidding, contextual relevance, and ML model integration** . + Ensure the ad server meets the ... Data Science, and Infrastructure teams to shape the next generation of **high-performance, ML -driven ad serving systems** , setting new standards for **latency,… more