- NVIDIA (Durham, NC)
- We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer ... who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-software… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Architect for LLM Inference ! NVIDIA is at the forefront of the generative AI revolution. Our Inference ... 6+ years of relevant industry experience + Detailed knowledge of deep learning inference serving, PyTorch programming, profiling, and compiler optimizations.… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, ... at the forefront of efficient large-scale model serving and inference . You will play a central role in improving...of groundbreaking language models. You'll work closely with the deep learning community to implement the latest… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, ... Our team is responsible for developing and maintaining high-performance deep learning frameworks, including SGLang and vLLM,...at the forefront of efficient large-scale model serving and inference . You will play a central role in improving… more
- NVIDIA (Santa Clara, CA)
- Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA's TensorRT team as a Senior ... best practices with C++11 and C++14. + Familiarity with deep learning concepts and frameworks. + A...models (such as Large Language Models) & frameworks for inference . + Background with C++17. NVIDIA is widely considered… more
- Amazon (Cupertino, CA)
- …Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference… more
- Amazon (Seattle, WA)
- …Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference… more
- NVIDIA (CA)
- …Dynamo Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team, and we are a remote friendly work ... on the world. We are now looking for a Senior System Software Engineer to work on user facing...world are using GPUs to power a revolution in deep learning , enabling breakthroughs in problems from… more
- NVIDIA (Santa Clara, CA)
- …and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative ... cache optimization, parallelism strategies). + Drive continuous innovation in deep learning inference performance to strengthen NVIDIA platform integration… more
- NVIDIA (Santa Clara, CA)
- …Deep understanding of modern data center architectures, accelerated computing, distributed inference , deep learning frameworks (PyTorch, TensorFlow, JAX), ... We are looking for a Senior Technical Product Marketing Manager. This role will...rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with… more