- NVIDIA (Santa Clara, CA)
- …(LLMs) and generative AI workloads. + Enhance performance tuning using TensorRT/TensorRT- LLM , NVIDIA NIM, and Triton Inference Server to improve GPU utilization ... and ensure efficient utilization in Kubernetes environments. + Proficiency with TensorRT- LLM , Triton, and TensorRT for model optimization and serving. + Success… more
- TP-Link North America, Inc. (Irvine, CA)
- …+ Design and maintain vector databases and embedding pipelines to support LLM applications, RAG (Retrieval Augmented Generation) systems, semantic search and agentic ... communication and problem-solving skills. PREFERRED QUALIFICATIONS + Experience with LLM frameworks and libraries (eg LangChain, LlamaIndex) is strongly preferred… more
- Amazon (Sunnyvale, CA)
- …algorithms and techniques to advance the state of Large Language Model ( LLM ) training. You will leverage Amazon's heterogeneous data sources and large-scale ... solutions. You will collaborate closely with the Applied Scientists on LLM reinforcement learning experiments and techniques to build automated training workflows.… more
- NVIDIA (Santa Clara, CA)
- …+ Convert and deploy models using frameworks such as TensorRT and TensorRT- LLM + Understand, analyze, profile, and optimize performance of deep learning workloads ... experience with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang. As NVIDIA makes inroads into the Datacenter business,… more
- LinkedIn (Mountain View, CA)
- …low latency high performance applications serving very large & complex models across LLM and Personalization models. As an engineer, you will build compute efficient ... other SRE/SWE Engineers, Project Managers, etc. -Experience building ML applications, LLM serving, GPU serving. -Experience with search systems or similar… more
- Walmart (Sunnyvale, CA)
- …for building, storing, and evolving customer profiles as persistent memory for LLM -based decisioning + Partner with product and UX teams to surface proactive ... assistants, or task-oriented dialog systems + Familiarity with long-context LLM applications, memory architectures, and personalization infrastructure + Knowledge of… more
- NVIDIA (Santa Clara, CA)
- …optimize models for efficient inference using frameworks such as TensorRT, TensorRT- LLM , vLLM, and SGLang. + Understand, analyze, profile, and optimize performance ... Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang, and ONNX. + Direct experience with NVIDIA Cosmos,… more
- Nelnet (Sacramento, CA)
- …and technology goals. . Act as a trusted advisor on AI governance, LLM access, and model risk. . Anticipate future regulatory requirements around AI usage ... similar languages. + Knowledge of AI-specific threat frameworks (eg, MITRE ATLAS, OWASP LLM Top 10). + Strong investigative skills and comfort working with logs,… more
- Google (San Francisco, CA)
- …seamless interactions. This is especially critical as product-market fit for novel LLM -powered experiences on devices is still emerging, requiring keen insight to ... champion a clear product vision, roadmap, and strategic plan for extending LLM capabilities via tools, effectively integrating emerging AI technologies and ensuring… more
- NVIDIA (Santa Clara, CA)
- …training methods. + Understanding of CPU/GPU architecture plus CUDA, cuDNN, TensorRT‑ LLM , Triton, NCCL + Excellent written and verbal communication for technical ... skills that simplify complex technology for diverse audiences. + Familiarity with modern LLM architectures and ability to write Torch code and occasional custom GPU… more