- NVIDIA (Santa Clara, CA)
- …that scale from a handful to thousands of GPUs, supporting a variety of LLM frameworks (eg, TensorRT- LLM , vLLM, SGLang). + Disaggregated Serving: Architect and ... of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT- LLM , llama.cpp, mistral.rs). + Improve intelligent routing and KV-cache management… more
- CVS Health (Sacramento, CA)
- …retrieval pipelines, Snowflake compute (Snowpark), and integration with LLM -driven applications. This role will be utilizing their expertise ... for compute, storage, and API usage related to document analytics and LLM integration. + Produce technical documentation, runbooks, and clear explanations of model/… more
- Oracle (Sacramento, CA)
- …Technologist** who brings both depth and versatility across AI/ML solutions & LLM agents. A **people leader** who can inspire, mentor, and grow high-performing ... a high-performing engineering team dedicated to developing and scaling AI/ML solutions and LLM agents for the healthcare sector. + Oversee the design and development… more
- Meta (Menlo Park, CA)
- …content and user understanding team, with a focus of Large Language Model ( LLM ). We conduct focused research and engineering to build state-of-the-art LLMs. As a ... key driver of Meta's app growth, we're dedicated use LLM -powered world knowledge to deliver user experiences across Facebook, Instagram, Threads, and more. We are… more
- NVIDIA (Santa Clara, CA)
- …in agentic and reasoning use cases. As the scale and complexity of these LLM systems continues to increase, we are seeking outstanding engineers to join our team ... and help shape the future of LLM inference. Our team is dedicated to pushing the...generative AI, agents, and inference systems into the NVIDIA LLM software stack. + Workload Analysis and Optimization: Conduct… more
- LinkedIn (Mountain View, CA)
- …the company. Our team works on a wide range of cutting-edge ML: LLM fine tuning, text generation, LLM -as-a-judge, prompt engineering, embedding-based retrieval, ... lead a team of applied scientists and engineers to design and deliver scalable LLM and matching solutions that improve the relevance and quality of LinkedIn's Talent… more
- Broadcom (CA)
- …lifecycle, from architecting robust backend services to deploying and monitoring highly-available LLM -powered systems in production. This is a senior position on a ... APIs in Java and Python to support ML and LLM workloads. + Lead the deployment and integration of... workloads. + Lead the deployment and integration of LLM and ML capabilities into our core products, collaborating… more
- Oracle (Sacramento, CA)
- …response times. + Practical experience with the latest technologies in LLM and generative AI, such as parameter-efficient fine-tuning, instruction fine-tuning, and ... and Model Context Protocol (MCP) + Hands-on experience with emerging LLM frameworks and plugins, such as LangChain, LlamaIndex, VectorStores and Retrievers,… more
- ServiceNow, Inc. (Santa Clara, CA)
- …deliver business value-not just promise potential. That means taking cutting-edge LLM capabilities and turning them into resilient, secure, and scalable software. ... you act as the CTO of the build-owning everything from backend services to LLM pipelines and front-end integrations. You partner with customers in the field to… more
- Amazon (Santa Monica, CA)
- …to develop automated, scalable solutions to questions in the Large Language Model ( LLM ) space. Applying a combination of expertise in LLMs, coding and linguistics ... do so, they will be tasked with the creation and development of LLM -assisted editorial tools, automated verification scripts and automated annotations (eg LLM… more