• Solutions Architect, Inference Deployments

    NVIDIA (Santa Clara, CA)
    …(LLMs) and generative AI workloads. + Enhance performance tuning using TensorRT/TensorRT- LLM , NVIDIA NIM, and Triton Inference Server to improve GPU utilization ... and ensure efficient utilization in Kubernetes environments. + Proficiency with TensorRT- LLM , Triton, and TensorRT for model optimization and serving. + Success… more
    NVIDIA (08/12/25)
    - Related Jobs
  • Big Data Engineer

    TP-Link North America, Inc. (Irvine, CA)
    …+ Design and maintain vector databases and embedding pipelines to support LLM applications, RAG (Retrieval Augmented Generation) systems, semantic search and agentic ... communication and problem-solving skills. PREFERRED QUALIFICATIONS + Experience with LLM frameworks and libraries (eg LangChain, LlamaIndex) is strongly preferred… more
    TP-Link North America, Inc. (08/11/25)
    - Related Jobs
  • Sr. Software Engineer (ML), AGI Foundations…

    Amazon (Sunnyvale, CA)
    …algorithms and techniques to advance the state of Large Language Model ( LLM ) training. You will leverage Amazon's heterogeneous data sources and large-scale ... solutions. You will collaborate closely with the Applied Scientists on LLM reinforcement learning experiments and techniques to build automated training workflows.… more
    Amazon (08/08/25)
    - Related Jobs
  • Senior Deep Learning Algorithm Engineer

    NVIDIA (Santa Clara, CA)
    …+ Convert and deploy models using frameworks such as TensorRT and TensorRT- LLM + Understand, analyze, profile, and optimize performance of deep learning workloads ... experience with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang. As NVIDIA makes inroads into the Datacenter business,… more
    NVIDIA (08/08/25)
    - Related Jobs
  • Staff Software Engineer, AI Platform

    LinkedIn (Mountain View, CA)
    …low latency high performance applications serving very large & complex models across LLM and Personalization models. As an engineer, you will build compute efficient ... other SRE/SWE Engineers, Project Managers, etc. -Experience building ML applications, LLM serving, GPU serving. -Experience with search systems or similar… more
    LinkedIn (08/08/25)
    - Related Jobs
  • Director, Data Science - Agent-Led Engagement…

    Walmart (Sunnyvale, CA)
    …for building, storing, and evolving customer profiles as persistent memory for LLM -based decisioning + Partner with product and UX teams to surface proactive ... assistants, or task-oriented dialog systems + Familiarity with long-context LLM applications, memory architectures, and personalization infrastructure + Knowledge of… more
    Walmart (08/08/25)
    - Related Jobs
  • Senior DL Algorithms Engineer - Cosmos

    NVIDIA (Santa Clara, CA)
    …optimize models for efficient inference using frameworks such as TensorRT, TensorRT- LLM , vLLM, and SGLang. + Understand, analyze, profile, and optimize performance ... Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang, and ONNX. + Direct experience with NVIDIA Cosmos,… more
    NVIDIA (08/08/25)
    - Related Jobs
  • CyberSecurity AI Engineer

    Nelnet (Sacramento, CA)
    …and technology goals. . Act as a trusted advisor on AI governance, LLM access, and model risk. . Anticipate future regulatory requirements around AI usage ... similar languages. + Knowledge of AI-specific threat frameworks (eg, MITRE ATLAS, OWASP LLM Top 10). + Strong investigative skills and comfort working with logs,… more
    Nelnet (08/08/25)
    - Related Jobs
  • Product Manager, Gemini App for Devices

    Google (San Francisco, CA)
    …seamless interactions. This is especially critical as product-market fit for novel LLM -powered experiences on devices is still emerging, requiring keen insight to ... champion a clear product vision, roadmap, and strategic plan for extending LLM capabilities via tools, effectively integrating emerging AI technologies and ensuring… more
    Google (08/08/25)
    - Related Jobs
  • Senior System Software Engineer, AI Infrastructure

    NVIDIA (Santa Clara, CA)
    …training methods. + Understanding of CPU/GPU architecture plus CUDA, cuDNN, TensorRT‑ LLM , Triton, NCCL + Excellent written and verbal communication for technical ... skills that simplify complex technology for diverse audiences. + Familiarity with modern LLM architectures and ability to write Torch code and occasional custom GPU… more
    NVIDIA (08/08/25)
    - Related Jobs