"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • Product Manager - Inference

    NVIDIA (Santa Clara, CA)



    Apply Now

    Inference is the fastest growing and most competitive area in Generative AI today. It is where AI models impact our daily life, and where ever bit of accuracy and performance matter for quality, safety, and cost. Inference is also constantly evolving, with new acceleration algorithms, usecases, and deployment techniques. As a Product Manager for AI Platform Inference you will be responsible for building the tools, SDKs, and libraries which enables developers' Inference deployments to thrive on NVIDIA GPUs.

     

    As NVIDIA Product Managers, our goal is to enable developers to be successful on the NVIDIA Platform, and push the boundaries of what is possible in AI deployments! As Product Managers, we are the champions inside NVIDIA for developers looking to accelerate their deployments on GPUs. We work directly with developers inside and outside of the company to identify key improvements, create roadmaps, and stay alert on the inference landscape. We also work with NVIDIA leaders to define clear product strategy, and marketing team teams to build go-to-market plans. The Product Management organization at NVIDIA is a small, strong, and impactful group. We focus on enabling deep learning across all GPU use cases and providing great solutions for developers. We are seeking a rare blend of product skills, technical depth, and passion to make NVIDIA great for developers. Does that sounds familiar? If so, we would love to hear from you!

    What you'll be doing:

    + Create products to help developers build better Inference deployments

    + Develop product strategy, roadmaps, and go-to-market plans

    + Collaborate with internal and external developers to build product-based roadmaps for model optimization software

    + Work with leadership to align with and drive company strategy

    What we need to see:

    + Experience with Inference deployment and optimization software (ex. vLLM, SGLang, FlashInfer, TensorRT-LLM, Triton, Dynamo, TorchAO, etc.)

    + Demonstrable knowledge of GenAI or machine learning concepts, particularly around performance optimization, and software development and delivery

    + BS or MS degree in Computer Science, Computer Engineering, or similar experience (or equivalent experience)

    + 5+ years of technical product management, or similar, experience at a technology company

    + Strong communication and interpersonal skills

    Ways to Stand Out from the crowd:

    + Experience leading optimization products for Inference

    + Working on Open Source & Github-first developer products with deep customer interactions

    + Knowledge of GPU architecture, HW/SW co-design, and performance profiling

     

    Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 144,000 USD - 218,500 USD for Level 3, and 168,000 USD - 258,750 USD for Level 4.

     

    You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) .

     

    Applications for this job will be accepted at least until July 29, 2025.

     

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

     


    Apply Now



Recent Searches

  • Principal Technical Program Manager (Texas)
  • neuroscience account manager miami (United States)
[X] Clear History

Recent Jobs

  • Product Manager - Inference
    NVIDIA (Santa Clara, CA)
  • Machine Operator
    Aerotek (New Castle, DE)
  • Lead Software Engineer
    OneMain Financial (Baltimore, MD)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2025 Alerted.org