ML Model Serving Infra Jobs in California | Alerted.org

Sr. Staff Software Engineer, AI Infra

LinkedIn (Mountain View, CA)

…PyTorch, DeepSpeed, GNNs, Flash Attention. PyTorch Lightning and more and more. Model Serving Infrastructure: this team builds low latency high performance ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more

LinkedIn (12/27/25)
- Related Jobs
Staff Software Engineer, ML Serving…

DoorDash (San Francisco, CA)

…Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation of our inference ... modern inference optimizations into production - Operationalize advances from the ML serving ecosystem (eg efficient caching, attention optimizations, batching,… more

DoorDash (11/24/25)
- Related Jobs
Sr Principal AI Software Engineer - ML & AI…

Oracle (Sacramento, CA)

…Cloud's AI Infra offerings + Design and implement scalable orchestration for serving and training AI/ ML models, Model Parallelism & Performance across ... optimizing large-scale distributed training/inference workloads + Have deep understanding of AI/ ML workflows, encompassing data processing, model training, and… more

Oracle (11/25/25)
- Related Jobs
Senior Software Engineer, AI Platform

LinkedIn (Mountain View, CA)

…the Feature Store, and serving feature data with high performance. Model Serving Infrastructure: this team builds low latency high performance applications ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more

LinkedIn (12/05/25)
- Related Jobs
Software Engineer, AI Platform

LinkedIn (Mountain View, CA)

…the Feature Store, and serving feature data with high performance. Model Serving Infrastructure: this team builds low latency high performance applications ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more

LinkedIn (10/21/25)
- Related Jobs
Lead Engineer, Inference Platform

MongoDB (Palo Alto, CA)

…routing, and model health monitoring + Collaborate with peers across ML , infra , and product teams to define architectural patterns and operational ... and low latency at scale + Guide decisions on model serving architecture using tools like vLLM,...or retrieval-augmented generation (RAG) + Contributions to relevant open-source ML serving infrastructure + 1+ years of… more

MongoDB (12/27/25)
- Related Jobs
Machine Learning Engineer

Insight Global (San Jose, CA)

…KV-cache tuning, and using efficient attention mechanisms like Flash Attention. Scalable Model Serving : Understanding of how to deploy models at scale, ... Privacy Policy: https://insightglobal.com/workforce-privacy-policy/. Skills and Requirements *3-5 years in ML /AI engineering roles owning training and/or serving … more

Insight Global (10/16/25)
- Related Jobs
Principal Cloud Architect, AI Computational Data…

Oracle (Sacramento, CA)

…knowledge of IaaS/PaaS industry and competitive capabilities. Experience with popular model training and serving frameworks like KServe, KubeFlow, Triton ... Transformers). + Experience in diagnosing, fixing, and resolving issues in AI model training and serving . **Responsibilities** **Responsibilities** As part of… more

Oracle (11/25/25)
- Related Jobs
Senior AI Engineering Manager, Enterprise AI

LinkedIn (Mountain View, CA)

… infra and platform teams to optimize retrieval and serving efficiency, including embedding optimization, adaptive caching, and parameter-efficient fine-tuning. + ... the company. Our team works on a wide range of cutting-edge ML : LLM fine tuning, text generation, LLM-as-a-judge, prompt engineering, embedding-based retrieval, and… more

LinkedIn (12/17/25)
- Related Jobs
Senior Director, Software Engineering

Walmart (Sunnyvale, CA)

…delivery** . + Drive innovation in **auction algorithms, dynamic bidding, contextual relevance, and ML model integration** . + Ensure the ad server meets the ... Data Science, and Infrastructure teams to shape the next generation of **high-performance, ML -driven ad serving systems** , setting new standards for **latency,… more

Walmart (11/11/25)
- Related Jobs

"Alerted.org

Advanced Search

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?