- Cadence Design Systems, Inc. (San Jose, CA)
- …in machine learning, deep learning, and natural language processing (NLP) and LLM . + Technical Skills + Proficiency in Python and experience with frameworks ... of Retrieval-Augmented Generation (RAG) and vector databases. + Comfortable working with LLM APIs and integrating them into applications. + A solid understanding of… more
- PennyMac (Westlake Village, CA)
- …AI for the Platform Services division,staying ahead of the curve on LLM capabilities, cost optimization, and model selection(eg, routing tasks between Claude 3.5 ... Experience with Evaluation Frameworks (eg, LangSmith, Ragas) for automated testingof LLM outputs.* Familiarity with the Model Context Protocol (MCP) for… more
- NVIDIA (Santa Clara, CA)
- …optimize models for efficient inference using frameworks such as TensorRT, TensorRT- LLM , vLLM, and SGLang. + Understand, analyze, profile, and optimize performance ... Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang, and ONNX. + Direct experience with NVIDIA Cosmos,… more
- Humana (Sacramento, CA)
- …+ Contribute to and publish novel research on alignment of LLM -based agents, multi-agent cooperation/conflict, or value learning + Proactively identify and ... high performance, large-scale ML systems + Experience with deploying or auditing LLM -based agents or multi-agent AI systems + Experience with large-scale ETL **Use… more
- VetsEZ (CA)
- …clinical and claims data. + Accelerate Development: Use AI-assisted tools and LLM integrations (eg, GitHub Copilot, Cursor, OpenAI, Bedrock, Claude) to reduce ... claims processing, or patient engagement systems. + Practical experience operationalizing LLM -based agents or automation pipelines in production. + Background in… more
- Insight Global (San Francisco, CA)
- …with deep expertise across Salesforce Core, GovCloud+, Data Cloud, and AI/ LLM -powered experiences. You will design, build, and ship production-grade features aligned ... Events) and 2+ years building AI-enabled solutions (prompt engineering, LLM orchestration, retrieval, embeddings, evaluation). - Hands-on experience with Agentforce… more
- NVIDIA (Santa Clara, CA)
- …learning and a strong algorithmic background, with exposure to large scale LLM /VLM deployment, inference optimization, and leadership experience, then this role may ... performance analysis and tuning + Exposure to inference platforms such as TensorRT- LLM , vLLM, and SGLang + Project management tools (eg JIRA, Microsoft Project)… more
- Microsoft Corporation (Mountain View, CA)
- …minds in the field, responsible for developing many of today's most influential LLM models. Notably, MAI has launched MAI-1 models, including MAI-1-image debuting in ... Manager, Foundational Research, you will be advancing the next generation of LLM models working on either (1) pre-training and post-training data and evaluations,… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …Kubernetes, and implementing CI/CD pipelines for AI model development. + Advanced LLM Deployment & Optimization: Lead the deployment, serving, and optimization of ... distillation, and using high-performance serving frameworks (eg, vLLM, TGI, TensorRT- LLM ) to maximize inference throughput and minimize latency. + Agentic… more
- LinkedIn (Mountain View, CA)
- …low latency high performance applications serving very large & complex models across LLM and Personalization models. As an engineer, you will build compute efficient ... distributed systems and client-server architectures + Experience building ML applications, LLM serving, GPU serving. + Co-author or maintainer of any open-source… more