- NVIDIA (Santa Clara, CA)
- …(LLMs) and generative AI workloads. + Enhance performance tuning using TensorRT/TensorRT- LLM , NVIDIA NIM, and Triton Inference Server to improve GPU utilization ... and ensure efficient utilization in Kubernetes environments. + Proficiency with TensorRT- LLM , Triton, and TensorRT for model optimization and serving. + Success… more
- Fannie Mae (Reston, VA)
- …serve as a subject matter expert to drive the engineering needs of the LLM platform, including the GenAI Gateway (Portkey), AWS AI services (AWS Bedrock and ... services, enhancing monitoring and telemetry, and supporting application teams consuming LLM capabilities. The ideal candidate will bring deep expertise in both… more
- Coinbase (Boise, ID)
- …automation. * *Deep AI & Automation Expertise:* * Strong understanding of LLM capabilities, limitations, and architectures (eg, RAG, agents, fine-tuning concepts). * ... ability to rapidly prototype and demonstrate working solutions. * Background in AI/ LLM infrastructure is a plus. * *Strong Software Engineering Skills:* * Proven… more
- Capital One (Albany, NY)
- …that whether a prospect asks a search engine or a Large Language Model ( LLM ) about enterprise data solutions, Capital One Software is the cited expert. To achieve ... and knowledge graphs, ensuring content is machine-readable and optimized for LLM retrieval. + "Answer-First" Content Direction: Collaborate with Content and Product… more
- NVIDIA (Santa Clara, CA)
- …support in understanding performance aspects related to tasks like large scale LLM training and inference. + Conducting regular technical customer meetings for ... RAPIDS, etc.). + Familiarity with deep learning architectures and the latest LLM developments. + Background with NVIDIA hardware and software, performance tuning,… more
- NVIDIA (Santa Clara, CA)
- …future needs and share best practices + Work with a diverse set of LLM workloads and their application areas such as health care, climate modeling, pharmaceuticals, ... + Familiarity with popular AI frameworks (PyTorch, TensorFlow, JAX, Megatron-LM, Tensort- LLM , VLLM) among others + Experience with AI/ML models and workloads,… more
- Kyndryl (Dallas, TX)
- …(eg, Redis). Preferred Skills and Experience + Experience with Temporal Workflows, Langfuse, LLM gateways such as LiteLLM + Prior experience in prompt engineering + ... AI-powered chatbots, content generators, or recommendation systems. + Familiarity with LLM fine-tuning, prompt engineering, and AI model optimization. + Knowledge of… more
- Pegasystems (Waltham, MA)
- …prompt and agent design and success across LLMs and versions, researching LLM , prompt engineering, and agent advancements, and collaborating on and aligning design ... + Knowledgeable in prompt design, measurement, and refinement, as well as managing LLM limitations and effectiveness + Experience with RAG (Graph, Agentic, etc.) is… more
- JPMorgan Chase (Wilmington, DE)
- …and knowledge base integrations, utilizing frameworks such as RAGAS for LLM model validation and performance monitoring. + **Work with state-of-the-art LLMs** ... architectures. + Experience with GenAI frameworks and tools, including RAGAS for LLM validation and knowledge base integration. + Hands-on experience with deep… more
- JPMorgan Chase (Washington, DC)
- …capabilities, and skills** + Deep understanding of Large Language Model ( LLM ) techniques, including Agents, Planning, Reasoning, and other related methods. + ... Experience of all LLM models and their capabilities + Experience with building and deploying ML models on cloud platforms such as AWS and AWS tools like Sagemaker,… more