- NVIDIA (Santa Clara, CA)
- …Learning Frameworks Infrastructure team as a Senior Systems Engineer focusing on High - Performance AI & Networking Applications, committed to ground-breaking AI & ... exposure to AI/ HPC workflows employing MPI and NCCL. + Familiarity with High -Speed Networking pertaining to HPC including InfiniBand, RDMA, RoCE, and Amazon… more
- IBM (San Jose, CA)
- …areas in the context of hybrid cloud, AI systems, networking, security, high -speed networked-storage, accelerators, and HPC principles. The selected candidate ... with executing HPC workloads * Familiarity with HPC system performance evaluation. At IBM, we...HPC : experience running HPC workloads on HPC systems * Quantum Computing : experience running… more
- Amazon (Cupertino, CA)
- …offers a unique opportunity to work at the intersection of machine learning, high - performance computing , and distributed architectures, where you'll help ... work on cutting-edge products at the intersection of machine-learning, high - performance computing , and distributed architectures....NVIDIA PTX and/or AMD GPU ISA - Experience developing high performance libraries for HPC … more
- IBM (San Jose, CA)
- …areas in the context of hybrid cloud, AI systems, networking, security, high -speed networked-storage, accelerators, and HPC principles. The selected candidate ... you'll join a team who invent what's next in computing , always choosing the big, urgent and mind-bending work...design * Experience with GPU Systems * Familiarity with HPC system performance evaluation. * Familiarity with… more
- Amazon (Cupertino, CA)
- …and operate next-generation infrastructure that powers breakthrough innovation in AI/ML and HPC workloads. If you're passionate about pushing the limits of ... performance , efficiency, and scalability in the cloud, this is...through server conception, design, test, launch, and operations. Driving high quality and reliability into future/new designs for AWS… more
- Genentech (South San Francisco, CA)
- …We're seeking a PhD/Master's student with expertise and passion for performance -aware scientific computing , particularly in machine learning systems. In ... performance -aware algorithms that scale on multi-node clusters. + Develop and optimize high - performance GPU kernels, and make trade offs to maximize hardware… more
- NVIDIA (Santa Clara, CA)
- …to the design and development of libraries and tools to simplify and accelerate computing for unstructured sparsity in DL and HPC . Around the world, leading ... engineering simulations, using data centers powered by GPUs and high - performance linear algebra libraries. Applications of these...and develop a C++-based system to simplify and accelerate computing for unstructured sparsity in DL and HPC… more
- Meta (Menlo Park, CA)
- …learning domains: Distributed ML Training, GPU architecture, ML systems, AI infrastructure, high performance computing , performance optimizations, or ... large-scale GPU training and inference fleet through an observable, reliable and high - performance distributed AI/GPU communication stack. Currently, one of the… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, ... supporting HPC or AI + Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control… more
- NVIDIA (Santa Clara, CA)
- …or equivalent experience. + Prior systems software or communication runtime or high performance networking software development experience with a successful ... libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . DL and HPC applications have a...CUDA, MPI, OpenMP, OpenACC, pthreads. + Background with RDMA, high - performance networking technologies (InfiniBand, RoCE, Ethernet, EFA),… more