- NVIDIA (Santa Clara, CA)
- …the choice, join our diverse team today! We are looking for an outstanding hands-on architect/ engineer for a Senior HPC architect role to support deployment ... develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- Amazon (Sunnyvale, CA)
- …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a senior HPC software engineer . As a member of our the High Performance Computing Software development team, you will be responsible ... work closely with technical leaders solving some of the biggest challenges in HPC , machine learning, cloud computing, and system co-design. What you'll be doing: The… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... is open to on-site and hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services… more
- NVIDIA (Santa Clara, CA)
- …What we need to see: + Deep expertise in data center server architectures, HPC systems , and hardware-software co-design. + Expert knowledge of Linux kernel ... NVIDIA is seeking a Senior Software Engineer to join our...OS, middleware, and applications with focus on AI/ML and HPC workloads. + Perform advanced system debugging, root cause… more
- NVIDIA (Santa Clara, CA)
- …to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads ... be doing: + Design and develop enhancements to the HPC batch scheduler(s). + Work extensively with HPC...and tools + Accomplished in computer architecture and operating systems + Experience analyzing and tuning performance for a… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
- NVIDIA (Santa Clara, CA)
- … AI Observability Engineer to help architect and implement distributed observability systems for AI and HPC clusters. We serve and collaborate directly with ... You will be working with a team of dedicated engineers on systems for data collection, aggregation, enrichment, storage, retrieval, and visualization to… more
- NVIDIA (Santa Clara, CA)
- Do you have expertise in CUDA kernel optimization, C++ systems programming, or compiler infrastructure? Join NVIDIA's nvFuser (https://github.com/NVIDIA/Fuser) team ... of GPUs! We're looking for engineers who excel at parallel programming and systems -level performance work and want to directly impact the future of AI compilation.… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking ... a Senior Software Engineer to lead the development...down cluster downtime towards zero, ensuring that our AI systems remain robust and reliable at all times. What… more