- Cisco (San Jose, CA)
- …the next generation of enterprise-grade AI infrastructure. As a principal engineer within our GPU and CUDA Runtime team, you will play a critical role in shaping the ... directly influence the performance, reliability, and scalability of large-scale GPU -accelerated workloads, powering mission-critical applications across AI/ML, scientific computing,… more
- Stanford University (Stanford, CA)
- …University has made a strategic investment in Marlowe, a GPU -centric high-performance computing instrument designed to enable large-scale, data-intensive research. ... the scientific process. It also demands the ability to leverage high-performance GPU computing to efficiently process and analyze large datasets. The successful… more
- NVIDIA (Santa Clara, CA)
- We are seeking Lead Post-Silicon Validation Engineer within the GPU Engineering Team to help drive development of future GPUs be used in 3D graphics, deep learning, ... reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC...modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's groundbreaking invention of the GPU in 1999 not only sparked the growth of the PC gaming market but also redefined modern computer graphics and ... revolutionized parallel computing. More recently, GPU deep learning has ignited the modern AI era,...learning has ignited the modern AI era, positioning the GPU as the brain behind computers, robots, and self-driving… more
- NVIDIA (Santa Clara, CA)
- …solutions for a broad range of AI-based applications. If you're creative, passionate about GPU hardware, and love having fun, please apply today! For two decades, we ... science of computer graphics. With the invention of the GPU - the engine of modern visual computing -...AI computing era, ignited by a new computing model, GPU deep learning. What you will be doing: +… more
- NVIDIA (Santa Clara, CA)
- …scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, ... GPU resource management, and intelligent request handling, Dynamo achieves...(context ingestion) and decode (token generation) phases across distinct GPU clusters to improve throughput and resource utilization. Contribute… more
- NVIDIA (Santa Clara, CA)
- …experienced LLVM Compiler Engineer for an exciting and fun role in our GPU Software organization. We deliver features and improvements to better realize the ... platforms. Our compiler organization makes its mark on every GPU NVIDIA produces. Would you like to add this...well as for accelerating general purpose computation on the GPU . You will be solving critical problems working alongside… more
- NVIDIA (Santa Clara, CA)
- …potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the ... seeking a Resiliency Architect to support the development and validation of GPU (graphical processing units) hardware and software resiliency features. In this role,… more
- NVIDIA (Santa Clara, CA)
- …seeking a Resiliency Architect to support the development and validation of GPU (graphical processing units) hardware and software resiliency features. In this role, ... metrics like Failures in Time for permanent and transient errors, and Availability from GPU to Rack to Datacenter. Use models to identify gaps and drive RAS… more
- NVIDIA (Santa Clara, CA)
- …roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect (Inference Focus), you'll ... you'll be doing: + Help customers craft, deploy, and maintain scalable, GPU -accelerated inference pipelines on Kubernetes for large language models (LLMs) and… more