- Microsoft Corporation (Redmond, WA)
- …of millions of customers. We are looking for a **Senior Researcher - LLM Systems** to invent, analyze, and productionize the next generation of serving architectures ... with inference serving frameworks (eg, vLLM, Triton Inference Server, TensorRT- LLM , ONNX Runtime/ORT, Ray Serve, DeepSpeed-MII). + Familiarity with GPU/accelerator… more
- Insight Global (Tampa, FL)
- Job Description We are looking for a Lead or Senior LLM Agent Engineer to join the team. They will be responsible for custom development with open source frameworks, ... of experience - LangGraph - Python and Flask APIs - MongoDB (creating and indexing) - LLM and Open AI models - RAG pipelines (set up and optimization) - Vector DBs -… more
- Oracle (Santa Fe, NM)
- …Healthcare AI systems. **Responsibilities** **Responsibilities** + Build and optimize LLM -powered agents for clinical and operational workflows (summarization, prior ... or related field, or equivalent experience + 2+ years production experience with LLM agents + Proven record improving agent reliability via reward modeling, policy… more
- NVIDIA (Santa Clara, CA)
- …looking for a Senior Research Scientist passionate about Large Language Model ( LLM ) and Diffusion Language Model (DLM) post-training and system optimization. Are you ... andscaling large distributed systems for deep learning. + Contributions to open-source LLM systems or large-scale AI infrastructure. NVIDIA is widely considered to… more
- Red Hat (Raleigh, NC)
- …our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments. You would be joining the core team behind 2025's most popular open ... testing of various inference optimization algorithms in the vLLM LLM -compressor (https://github.com/vllm-project/ llm -compressor) project + Create and manage… more
- Google (Mountain View, CA)
- Software Engineer Manager, Photos Agent/ LLM Infrastructure _corporate_fare_ Google _place_ Mountain View, CA, USA; Los Angeles, CA, USA **Advanced** Experience ... owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain. _info_outline_ X This role may also be located in our Playa Vista, CA campus. Applicants in the County of Los Angeles: Qualified… more
- JPMorgan Chase (Jersey City, NJ)
- …+ Familiarity with engineering systems using large language models + Familiarity with LLM tools such as Langchain or Haystack JPMorganChase, one of the oldest ... financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the JP Morgan and Chase brands. Our history spans over 200… more
- US Tech Solutions (New York, NY)
- …to develop high-quality content for LLMs. + As a content designer working on an LLM , you'll work in an ambiguous, fast-paced, and very exciting area with a lot of ... passionate people. + Conduct reviews and audits of model-generated content to ensure compliance with established standards and identify areas for improvement. + Develop strategies for improving content quality, as well as instructions, guidelines, and… more
- Amazon (Seattle, WA)
- …to lead developing foundational behavioral model for Amazon Stores using Generative AI, LLM and Large Model training techniques. On a day-to-day basis, you will: - ... Research and implement new algorithms and architectures for generative AI applications. - Optimize model performance and scalability for inference and deployment. - Collaborate with other talented applied scientists and engineers to gather and preprocess large… more
- Target (Sunnyvale, CA)
- …mentality that stays on the leading edge of search, query understanding, applied ML, LLM 's and NLP advancements This position will operate as a Hybrid/Flex for Your ... Day work arrangement based on Target's needs. A Hybrid/Flex for Your Day work arrangement means the team member's core role will need to be performed both onsite at the Target HQ Sunnyvale or MN location the role is assigned to and virtually, depending upon… more