- The Walt Disney Company (Emeryville, CA)
- …of our team, you'll apply your understanding of high-performance computing ( HPC ) storage architectures to administer and help architect our centralized storage ... within our studio. **RESPONSIBILITIES:** + Provide operational support for our on-prem HPC storage systems + Collaborate with vendors to develop and conduct… more
- NVIDIA (Santa Clara, CA)
- …in debugging and performance tuning large-scale AI workloads in cloud and HPC environments, ensuring seamless operation of AI training and inference workloads. What ... computing in AI training. + Experience working with large-scale AI clusters, HPC environments, or cloud-based AI workloads . + Strong systems programming skills… more
- NVIDIA (Santa Clara, CA)
- …Now, GPU deep learning is driving modern AI forward. Join our GPU AI/ HPC Infrastructure team and lead the design of groundbreaking GPU compute clusters for ... demanding AI, HPC , and compute-intensive workloads. We are seeking an engineering leader for the Data Platform team to empower NVIDIA engineering teams with high… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …matter expert for infrastructure initiatives, particularly in High Performance Computing ( HPC ) environments + Drive operational excellence by + Achieving highest ... of 8 years of experience + Proven experience in technical systems ( HPC , networking, storage) + Demonstrated ability to operate and manage large-scale,… more
- NVIDIA (Santa Clara, CA)
- …spanning firmware, OS, middleware, and applications with focus on AI/ML and HPC workloads. + Perform advanced system debugging, root cause analysis, and performance ... we need to see: + Deep expertise in data center server architectures, HPC systems, and hardware-software co-design. + Expert knowledge of Linux kernel internals,… more
- NVIDIA (Santa Clara, CA)
- …help design and deploy cutting-edge NVIDIA networking platforms to run AI and HPC workloads + Address sophisticated and highly visible customer issues + Work closely ... + Familiarity with the Infiniband spec + Experience with distributed processing, HPC , and Message Passing Interface (MPI) + Strong analytical and problem-solving… more
- Deloitte (Sacramento, CA)
- …in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI/GenAI infrastructure and design patterns + Define and lead technology proof of ... Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack Information for applicants with a need for accommodation:… more
- General Atomics (San Diego, CA)
- …focusing primarily on Linux systems, advanced computing environments such as HPC , cyber security, file systems, data backup), concepts, theory, and practice ... and podamn. Linux KVM/QEMU/Libvirt based virtual host provisioning and management. HPC related software like SLURM, InfiniBand, environment modules (LMOD), parallel… more
- NVIDIA (Santa Clara, CA)
- …communication runtimes for Deep Learning frameworks (eg NCCL for TensorFlow/Pytorch) and HPC programming interfaces (eg UCX for MPI/OpenSHMEM) on GPU clusters. + ... of high-performance networks like InfiniBand, iWARP etc. + Experience with HPC applications. + Experience with Deep Learning Frameworks such PyTorch, TensorFlow,… more
- Google (Sunnyvale, CA)
- …as we continue to push technology forward. GCP is favored by AI/ML and HPC customers for its rich heritage of offerings including TPUs, GPUs, GKE for scheduling ... workloads, Vertex AI, integration with GCS object storage, etc. Customers want their Lustre file system to work seamlessly with all these technologies, and deliver the maximum performance. Come help us make Lustre shine on GCP and work cohesively with all… more