• Sr. Staff Software Engineer, AI Infra

    LinkedIn (Mountain View, CA)
    …PyTorch, DeepSpeed, GNNs, Flash Attention. PyTorch Lightning and more and more. Model Serving Infrastructure: this team builds low latency high performance ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more
    LinkedIn (12/27/25)
    - Related Jobs
  • Staff Software Engineer, ML Serving

    DoorDash (San Francisco, CA)
    …Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation of our inference ... modern inference optimizations into production - Operationalize advances from the ML serving ecosystem (eg efficient caching, attention optimizations, batching,… more
    DoorDash (11/24/25)
    - Related Jobs
  • Sr Principal AI Software Engineer - ML & AI…

    Oracle (Sacramento, CA)
    …Cloud's AI Infra offerings + Design and implement scalable orchestration for serving and training AI/ ML models, Model Parallelism & Performance across ... optimizing large-scale distributed training/inference workloads + Have deep understanding of AI/ ML workflows, encompassing data processing, model training, and… more
    Oracle (11/25/25)
    - Related Jobs
  • Senior Software Engineer, AI Platform

    LinkedIn (Mountain View, CA)
    …the Feature Store, and serving feature data with high performance. Model Serving Infrastructure: this team builds low latency high performance applications ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more
    LinkedIn (12/05/25)
    - Related Jobs
  • Software Engineer, AI Platform

    LinkedIn (Mountain View, CA)
    …the Feature Store, and serving feature data with high performance. Model Serving Infrastructure: this team builds low latency high performance applications ... together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with...model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more
    LinkedIn (10/21/25)
    - Related Jobs
  • Lead Engineer, Inference Platform

    MongoDB (Palo Alto, CA)
    …routing, and model health monitoring + Collaborate with peers across ML , infra , and product teams to define architectural patterns and operational ... and low latency at scale + Guide decisions on model serving architecture using tools like vLLM,...or retrieval-augmented generation (RAG) + Contributions to relevant open-source ML serving infrastructure + 1+ years of… more
    MongoDB (12/27/25)
    - Related Jobs
  • Machine Learning Engineer

    Insight Global (San Jose, CA)
    …KV-cache tuning, and using efficient attention mechanisms like Flash Attention. Scalable Model Serving : Understanding of how to deploy models at scale, ... Privacy Policy: https://insightglobal.com/workforce-privacy-policy/. Skills and Requirements *3-5 years in ML /AI engineering roles owning training and/or serving more
    Insight Global (10/16/25)
    - Related Jobs
  • Principal Cloud Architect, AI Computational Data…

    Oracle (Sacramento, CA)
    …knowledge of IaaS/PaaS industry and competitive capabilities. Experience with popular model training and serving frameworks like KServe, KubeFlow, Triton ... Transformers). + Experience in diagnosing, fixing, and resolving issues in AI model training and serving . **Responsibilities** **Responsibilities** As part of… more
    Oracle (11/25/25)
    - Related Jobs
  • Senior AI Engineering Manager, Enterprise AI

    LinkedIn (Mountain View, CA)
    infra and platform teams to optimize retrieval and serving efficiency, including embedding optimization, adaptive caching, and parameter-efficient fine-tuning. + ... the company. Our team works on a wide range of cutting-edge ML : LLM fine tuning, text generation, LLM-as-a-judge, prompt engineering, embedding-based retrieval, and… more
    LinkedIn (12/17/25)
    - Related Jobs
  • Senior Director, Software Engineering

    Walmart (Sunnyvale, CA)
    …delivery** . + Drive innovation in **auction algorithms, dynamic bidding, contextual relevance, and ML model integration** . + Ensure the ad server meets the ... Data Science, and Infrastructure teams to shape the next generation of **high-performance, ML -driven ad serving systems** , setting new standards for **latency,… more
    Walmart (11/11/25)
    - Related Jobs