- NVIDIA (Santa Clara, CA)
- …What we need to see: + Deep expertise in data center server architectures, HPC systems , and hardware-software co-design. + Expert knowledge of Linux kernel ... NVIDIA is seeking a Senior Software Engineer to join our...OS, middleware, and applications with focus on AI/ML and HPC workloads. + Perform advanced system debugging, root cause… more
- NVIDIA (Santa Clara, CA)
- …to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads ... be doing: + Design and develop enhancements to the HPC batch scheduler(s). + Work extensively with HPC...and tools + Accomplished in computer architecture and operating systems + Experience analyzing and tuning performance for a… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
- NVIDIA (Santa Clara, CA)
- … AI Observability Engineer to help architect and implement distributed observability systems for AI and HPC clusters. We serve and collaborate directly with ... You will be working with a team of dedicated engineers on systems for data collection, aggregation, enrichment, storage, retrieval, and visualization to… more
- NVIDIA (Santa Clara, CA)
- Do you have expertise in CUDA kernel optimization, C++ systems programming, or compiler infrastructure? Join NVIDIA's nvFuser (https://github.com/NVIDIA/Fuser) team ... of GPUs! We're looking for engineers who excel at parallel programming and systems -level performance work and want to directly impact the future of AI compilation.… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking ... a Senior Software Engineer to lead the development...down cluster downtime towards zero, ensuring that our AI systems remain robust and reliable at all times. What… more
- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. We are looking for an outstanding engineer for a Senior Performance Engineer role for at scale AI system ... develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems... HPC , OS, CPU and GPU compute, and systems specialist to architect, develop and bring up large… more
- NVIDIA (Santa Clara, CA)
- …the next wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and ... crew that develops and maintains software for complex heterogeneous computing systems that power disruptive products in High Performance Computing and Deep… more
- NVIDIA (Santa Clara, CA)
- The NVIDIA Enterprise Experience (NVEX) Solutions Engineering team is looking for a senior Computer or Software Engineer who is ready to become an authority in ... the highest level of support for InfiniBand, NVLink, and Spectrum-X network systems that interconnect GPUs and AI compute infrastructure. Candidates must have a… more
- NVIDIA (Santa Clara, CA)
- NVIDIA data center systems , such as DGX and HGX, have become...and HPC software stack. We are hiring Sr . Software Engineer who will help build ... for our DGX Server platforms. Simulations play a significant role in building scalable systems at Speed of Light! You will work with world class engineering teams… more