- NVIDIA (Santa Clara, CA)
- …role involves developing tools for AI researchers and SW/HW teams running AI workload in GPU cluster. As a member of the software development team, we will work with ... debugging tricky failures and issues to help improve the performance and efficiency of the system. What you'll be...Create benchmarking and simulation technologies for AI system or GPU cluster + Partner with HW architects to propose… more
- NVIDIA (Santa Clara, CA)
- …+ Work with various teams at NVIDIA to incorporate and influence the latest technologies for GPU performance analysis What we need to see: + Minimum of 8+ years ... to convert profiling data into actionable optimizations + Support deep learning software engineers and GPU architects in their performance analysis efforts… more
- Amazon (Cupertino, CA)
- …OpenCL, SYCL, or ROCm - Demonstrated experience with NVIDIA PTX and/or AMD GPU ISA - Experience developing high performance libraries for HPC applications ... Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the...GPUs, CPUs, FPGAs, or custom architectures - Experience with GPU kernel optimization and GPGPU computing such as CUDA,… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …for the entire lifecycle of our AI systems, from architecting and building high- performance GPU clusters to deploying and optimizing our most advanced AI ... and manage monitoring solutions for system health, job statuses, GPU utilization, and container performance to proactively...Proven track record as a Principal or Senior Staff Engineer . + Expert-level knowledge of NVIDIA GPU … more
- NVIDIA (Santa Clara, CA)
- …potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the ... applications. NVIDIA is driven to deliver the best possible performance which allows researchers and scientists to do more...and passionate about joining NVIDIA. + Familiarity with NVIDIA GPU products and technologies. + Experience using GPU… more
- Microsoft Corporation (Mountain View, CA)
- **Overview** The Artificial Intelligence Performance team at Microsoft develops AI software that enables running AI models everywhere, from world's fastest AI ... on a collaborative and inclusive culture. We own inference performance of OpenAI and other state of the art...Bing, SQL Server, and Dynamics. As a Senior Software Engineer on the team, you will have the opportunity… more
- NVIDIA (Santa Clara, CA)
- …AI inference frameworks (eg, vLLM, TensorRT-LLM, SGLang). + Experience with GPU resource scheduling, cache management, or high- performance networking. + ... scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture,… more
- NVIDIA (Santa Clara, CA)
- …and reasoning models across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU shards, ... outgrow the memory and compute budget of any single GPU , this platform enables efficient, resilient deployment of cutting-edge...cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory… more
- General Dynamics Information Technology (Chantilly, VA)
- …**Job Description:** HPC Engineer Who You Are You are a talented, multidiscipline engineer versed in getting the best performance out of systems. You are ... familiar with High Performance Computing using both CPU and GPU based systems. You understand scheduling using SLURM, computing using MPI, and operating software… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA AI and HPC software stack. We are searching for a highly motivated engineer to lead performance benchmarking and optimization efforts for our data center ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...instrumental in ensuring our data center solutions deliver industry-leading performance for accelerated computing workloads. What you will be… more