• DatologyAI (Redwood City, CA)
    …looking for an engineer with deep experience building and operating large-scale training and inference systems. You will design, implement, and maintain the ... researchers to productionize new models and features quickly and safely. Optimize training and inference pipelines for performance, reliability, and cost. Ensure… more
    job goal (01/13/26)
    - Related Jobs
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference more
    job goal (01/13/26)
    - Related Jobs
  • Menlo Ventures (San Francisco, CA)
    About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
    job goal (01/13/26)
    - Related Jobs
  • Amazon (San Francisco, CA)
    Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
    job goal (01/12/26)
    - Related Jobs
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Deep Learning Software Engineer , Inference page is loaded## Senior Deep Learning Software Engineer , Inferencelocations: US, CA, Santa Clara: ... requisition id: JR2002670NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for...and Python experience is a plus.* Prior experience with training , deploying or optimizing the inference of… more
    job goal (01/13/26)
    - Related Jobs
  • OpenAI (San Francisco, CA)
    …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
    job goal (01/13/26)
    - Related Jobs
  • OpenAI (San Francisco, CA)
    …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...work is inherently cross-functional: you'll collaborate directly with researchers training these models and with product teams defining new… more
    job goal (01/13/26)
    - Related Jobs
  • Google Inc. (Sunnyvale, CA)
    Software Engineer III, Infrastructure, Inference Control Plane corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree or equivalent practical ... goes on and is growing every day. As a software engineer , you will work on a...push technology forward. The mission of Vertex AI Online Inference Infrastructure team is to build a model serving… more
    job goal (01/13/26)
    - Related Jobs
  • jobr.pro (Sunnyvale, CA)
    …UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with ... Large Language Models (LLM) and other Machine Learning (ML) models for inference . Experience building GPU-related software . Experience with compilers or ML… more
    job goal (01/13/26)
    - Related Jobs
  • quadric.io, Inc (Burlingame, CA)
    …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network...models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port… more
    job goal (01/13/26)
    - Related Jobs