• Solutions Architect, Inference Deployments

    NVIDIA (Santa Clara, CA)
    …(LLMs) and generative AI workloads. + Enhance performance tuning using TensorRT/TensorRT- LLM , NVIDIA NIM, and Triton Inference Server to improve GPU utilization ... and ensure efficient utilization in Kubernetes environments. + Proficiency with TensorRT- LLM , Triton, and TensorRT for model optimization and serving. + Success… more
    NVIDIA (11/22/25)
    - Related Jobs
  • Lead GenAI Agent Developer

    Fannie Mae (Reston, VA)
    …serve as a subject matter expert to drive the engineering needs of the LLM platform, including the GenAI Gateway (Portkey), AWS AI services (AWS Bedrock and ... services, enhancing monitoring and telemetry, and supporting application teams consuming LLM capabilities. The ideal candidate will bring deep expertise in both… more
    Fannie Mae (11/22/25)
    - Related Jobs
  • AI Growth Lead (Staff Software Engineer)

    Coinbase (Boise, ID)
    …automation. * *Deep AI & Automation Expertise:* * Strong understanding of LLM capabilities, limitations, and architectures (eg, RAG, agents, fine-tuning concepts). * ... ability to rapidly prototype and demonstrate working solutions. * Background in AI/ LLM infrastructure is a plus. * *Strong Software Engineering Skills:* * Proven… more
    Coinbase (11/21/25)
    - Related Jobs
  • Senior Manager, Digital Content Strategy & Organic…

    Capital One (Albany, NY)
    …that whether a prospect asks a search engine or a Large Language Model ( LLM ) about enterprise data solutions, Capital One Software is the cited expert. To achieve ... and knowledge graphs, ensuring content is machine-readable and optimized for LLM retrieval. + "Answer-First" Content Direction: Collaborate with Content and Product… more
    Capital One (11/21/25)
    - Related Jobs
  • Senior Solutions Architect, GPU - Cloud Service…

    NVIDIA (Santa Clara, CA)
    …support in understanding performance aspects related to tasks like large scale LLM training and inference. + Conducting regular technical customer meetings for ... RAPIDS, etc.). + Familiarity with deep learning architectures and the latest LLM developments. + Background with NVIDIA hardware and software, performance tuning,… more
    NVIDIA (11/21/25)
    - Related Jobs
  • Senior DGX Cloud Performance Engineer

    NVIDIA (Santa Clara, CA)
    …future needs and share best practices + Work with a diverse set of LLM workloads and their application areas such as health care, climate modeling, pharmaceuticals, ... + Familiarity with popular AI frameworks (PyTorch, TensorFlow, JAX, Megatron-LM, Tensort- LLM , VLLM) among others + Experience with AI/ML models and workloads,… more
    NVIDIA (11/21/25)
    - Related Jobs
  • Fullstack Agentic AI Engineer

    Kyndryl (Dallas, TX)
    …(eg, Redis). Preferred Skills and Experience + Experience with Temporal Workflows, Langfuse, LLM gateways such as LiteLLM + Prior experience in prompt engineering + ... AI-powered chatbots, content generators, or recommendation systems. + Familiarity with LLM fine-tuning, prompt engineering, and AI model optimization. + Knowledge of… more
    Kyndryl (11/21/25)
    - Related Jobs
  • Senior Prompt Engineer, Pega Blueprint

    Pegasystems (Waltham, MA)
    …prompt and agent design and success across LLMs and versions, researching LLM , prompt engineering, and agent advancements, and collaborating on and aligning design ... + Knowledgeable in prompt design, measurement, and refinement, as well as managing LLM limitations and effectiveness + Experience with RAG (Graph, Agentic, etc.) is… more
    Pegasystems (11/21/25)
    - Related Jobs
  • Applied AI ML Senior Associate

    JPMorgan Chase (Wilmington, DE)
    …and knowledge base integrations, utilizing frameworks such as RAGAS for LLM model validation and performance monitoring. + **Work with state-of-the-art LLMs** ... architectures. + Experience with GenAI frameworks and tools, including RAGAS for LLM validation and knowledge base integration. + Hands-on experience with deep… more
    JPMorgan Chase (11/21/25)
    - Related Jobs
  • Vice President - Generative Artificial…

    JPMorgan Chase (Washington, DC)
    …capabilities, and skills** + Deep understanding of Large Language Model ( LLM ) techniques, including Agents, Planning, Reasoning, and other related methods. + ... Experience of all LLM models and their capabilities + Experience with building and deploying ML models on cloud platforms such as AWS and AWS tools like Sagemaker,… more
    JPMorgan Chase (11/20/25)
    - Related Jobs