- Oracle (Sacramento, CA)
- …, supporting millions of devices, multi-region interconnects, and high-performance compute ( HPC /AI/GPU) environments. + Integrate ML and LLM-based models into OCI's ... predictive analytics for infrastructure operations. + Hands-on understanding of HPC , GPU networking, and high-bandwidth interconnect architectures. + Proficiency in… more
- NVIDIA (Santa Clara, CA)
- …technical and interpersonal skills to analyze, define, implement and optimize AI/ML and HPC software and system solutions at hyper scale . What you'll be doing: ... Passion for enhancing customer experience + Proficiency in AI, ML and HPC applications + Comprehensive knowledge of computer system architecture including PCIe,… more
- NVIDIA (Santa Clara, CA)
- …Artificial Intelligence (AI), Deep Learning (DL), autonomous vehicles, and High-Performance Computing ( HPC )? NVIDIA is seeking a skilled CPU Power Architect to own ... a global leader in accelerated computing, delivering breakthroughs in AI, HPC , and advanced system design. Our technologies power transformative applications across… more
- Meta (Menlo Park, CA)
- …end-to-end AI product introductions and AI operations initiatives supporting Meta's growing AI/ HPC infrastructure for our Family of Apps . They will be responsible ... on shared goals 10. The ideal candidate will have experience in AI/ HPC product development and operations, demonstrated experience in the Network communications… more
- Amazon (Cupertino, CA)
- …full software development experience - Expertise in accelerator architectures for ML or HPC such as GPUs, CPUs, FPGAs, or custom architectures - Experience with GPU ... and/or AMD GPU ISA - Experience developing high performance libraries for HPC applications - Proficiency in low-level performance optimization for GPUs - Experience… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technical architect to own the ... in standards bodies such as OCP and DMTF. + Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA) + Knowledge of enterprise storage… more
- UIC Government Services and the Bowhead Family of Companies (San Diego, CA)
- …for building/maintaining automated build materials for each CREATE Product for all supported HPC platforms. These include all HPCMP DSRC systems, as well as other ... HPC clusters approved through CREATE Configuration Control Board (CCB) process. **Qualifications** * BA/S in Computer Science, Information Systems, Engineering,… more
- Meta (Menlo Park, CA)
- …to algorithms, tooling, and interfaces, working across multiple accelerator types and HPC collective communication libraries such as NCCL, RCCL, UCC and MPI. 7. ... of the following areas: Accelerators/GPU architectures, High Performance Computing ( HPC ), Machine Learning Compilers, Training/Inference ML Systems, Model Compression,… more
- Meta (Menlo Park, CA)
- …Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on ... Fully Sharded Data Parallel (FSDP), Tensor Parallel, and Pipeline Parallel 9. Experience in HPC and parallel computing 10. Knowledge of ML, deep learning and LLM 11.… more
- NVIDIA (Santa Clara, CA)
- …software management ecosystem. We are focused on supporting NVIDIA products across HPC , cloud, and enterprise on both bare metal and virtualized platforms as ... Experience developing Kubernetes operators or Helm charts + Experience with HPC job schedulers like Slurm or Run.AI Familiarity with Kubernetes internals.… more