- Databricks Inc. (San Francisco, CA)
- …our customers can use deep data insights to improve their business. Databricks' Model Serving product provides enterprises with a unified, scalable, and governed ... inference, governance, monitoring, and lineage. As AI adoption accelerates, Model Serving is a core pillar of...with strong SLAs and cost efficiency. As a Senior Engineer , you'll play a critical role in shaping both… more
- Databricks Inc. (San Francisco, CA)
- …our customers can use deep data insights to improve their business. Databricks' Model Serving product provides enterprises with a unified, scalable, and governed ... inference, governance, monitoring, and lineage. As AI adoption accelerates, Model Serving is a core pillar of...with strong SLAs and cost efficiency. As a Staff Engineer , you'll play a critical role in shaping both… more
- Cerebras (Palo Alto, CA)
- …As part of this role, you will: Develop a highly available service for ML model serving . Enhance Ray Serve and our other libraries to simplify the development ... to democratize distributed computing and make it accessible to software developers of all skill levels. We're commercializing Ray,...savings. Optimize latency and throughput for both single- and multi- model serving scenarios. We'd love to hear… more
- Menlo Ventures (San Francisco, CA)
- …platform so our customers can use deep data insights to improve their business. Foundation Model Serving is the API Product for hosting and serving frontier ... LLM APIs and runtimes at scale. As a Staff Engineer , you'll play a critical role in shaping both...implement core systems and APIs that power Databricks Foundation Model Serving , ensuring scalability, reliability, and operational… more
- Databricks Inc. (San Francisco, CA)
- …platform so our customers can use deep data insights to improve their business. Foundation Model Serving is the API Product for hosting and serving frontier ... LLM APIs and runtimes at scale. As a Staff Engineer , you'll play a critical role in shaping both...implement core systems and APIs that power Databricks Foundation Model Serving , ensuring scalability, reliability, and operational… more
- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best ... deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team,...like PyTorch, TensorFlow, and Hugging Face Transformers. Experience with model serving tools (eg, NVIDIA Triton, TensorFlow… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... lifecycles along with work experience on some optimizations for improving the model execution. - Software development experience in C++, Python (experience… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... lifecycles along with work experience on some optimizations for improving the model execution. Software development experience in C++, Python (experience in… more
- Baseten (San Francisco, CA)
- …API endpoints for the latest open‑source models. This work spans distributed systems, model serving , and developer experience. You'll join a small, high‑impact ... authentication. Collaborate closely with other teams to deliver robust, developer‑friendly model serving experiences. REQUIREMENTS 3+ years experience building… more
- Amazon (Cupertino, CA)
- …Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a particular focus on large-scale generative AI ... Software Development Engineer AI/ML, Inference Serving , AWS Neuron...resilient AI infrastructure at AWS. We focus on developing model -agnostic inference innovations, including disaggregated serving , distributed… more