• Databricks Inc. (San Francisco, CA)
    …our customers can use deep data insights to improve their business. Databricks' Model Serving product provides enterprises with a unified, scalable, and governed ... inference, governance, monitoring, and lineage. As AI adoption accelerates, Model Serving is a core pillar of...with strong SLAs and cost efficiency. As a Senior Engineer , you'll play a critical role in shaping both… more
    job goal (01/13/26)
    - Related Jobs
  • Databricks Inc. (San Francisco, CA)
    …our customers can use deep data insights to improve their business. Databricks' Model Serving product provides enterprises with a unified, scalable, and governed ... inference, governance, monitoring, and lineage. As AI adoption accelerates, Model Serving is a core pillar of...with strong SLAs and cost efficiency. As a Staff Engineer , you'll play a critical role in shaping both… more
    job goal (01/13/26)
    - Related Jobs
  • Cerebras (Palo Alto, CA)
    …As part of this role, you will: Develop a highly available service for ML model serving . Enhance Ray Serve and our other libraries to simplify the development ... to democratize distributed computing and make it accessible to software developers of all skill levels. We're commercializing Ray,...savings. Optimize latency and throughput for both single- and multi- model serving scenarios. We'd love to hear… more
    job goal (01/13/26)
    - Related Jobs
  • Menlo Ventures (San Francisco, CA)
    …platform so our customers can use deep data insights to improve their business. Foundation Model Serving is the API Product for hosting and serving frontier ... LLM APIs and runtimes at scale. As a Staff Engineer , you'll play a critical role in shaping both...implement core systems and APIs that power Databricks Foundation Model Serving , ensuring scalability, reliability, and operational… more
    job goal (01/13/26)
    - Related Jobs
  • Databricks Inc. (San Francisco, CA)
    …platform so our customers can use deep data insights to improve their business. Foundation Model Serving is the API Product for hosting and serving frontier ... LLM APIs and runtimes at scale. As a Staff Engineer , you'll play a critical role in shaping both...implement core systems and APIs that power Databricks Foundation Model Serving , ensuring scalability, reliability, and operational… more
    job goal (01/12/26)
    - Related Jobs
  • Clutch Canada (Atlanta, GA)
    …week. Overview As Speechify expands, our AI team seeks a Senior Backend Engineer . This role is central to ensuring our infrastructure scales efficiently, optimizing ... key product flows, and constructing resilient end-to-end systems. If you are passionate about strategizing, enjoy high‑paced environments, and is eager to take ownership of product decisions, we'd love to hear from you. What You'll Do State of the art voice… more
    job goal (01/13/26)
    - Related Jobs
  • Apple Inc. (San Francisco, CA)
    Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best ... deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team,...like PyTorch, TensorFlow, and Hugging Face Transformers. Experience with model serving tools (eg, NVIDIA Triton, TensorFlow… more
    job goal (01/13/26)
    - Related Jobs
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... lifecycles along with work experience on some optimizations for improving the model execution. - Software development experience in C++, Python (experience… more
    job goal (01/13/26)
    - Related Jobs
  • Amazon (San Francisco, CA)
    Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... lifecycles along with work experience on some optimizations for improving the model execution. Software development experience in C++, Python (experience in… more
    job goal (01/12/26)
    - Related Jobs
  • Baseten (San Francisco, CA)
    …API endpoints for the latest open‑source models. This work spans distributed systems, model serving , and developer experience. You'll join a small, high‑impact ... authentication. Collaborate closely with other teams to deliver robust, developer‑friendly model serving experiences. REQUIREMENTS 3+ years experience building… more
    job goal (01/13/26)
    - Related Jobs