- NVIDIA (Santa Clara, CA)
- …Understanding of fast, distributed storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning frameworks like PyTorch and ... join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute,… more
- NVIDIA (Santa Clara, CA)
- …+ Experience analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, ... fast, distributed storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning frameworks including PyTorch, MegatronLM… more
- NVIDIA (Santa Clara, CA)
- …workloads. Observability is at the heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build the ... and artificial intelligence. Our technology powers everything from generative AI to autonomous systems , and we continue...and large-scale monitoring systems . + Familiarity with AI /ML pipelines, GPU-based workloads , and HPC … more
- Lilly (Indianapolis, IN)
- …the engineering and operations of advanced Linux platforms supporting AI and HPC workloads, managing Nvidia DGX systems using Mission Control, Base Command ... the world. Come help us unlock the power of HPC and AI based POGPU and Accelerated...engineering and development of Advanced Linux computing capabilities for AI /ML. Additionally, you would advise with our senior… more
- NVIDIA (Santa Clara, CA)
- …and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new ... and new network hardware features. + Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM),… more
- NVIDIA (Westford, MA)
- …team and see how you can make a lasting impact on the world. We are seeking a Senior HPC & Quantum Systems Engineer to help architect, deploy, and operate a ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...computation, quantum execution, and data movement across tightly coupled systems . HPC Systems & Operations… more
- NVIDIA (Santa Clara, CA)
- …to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis Engineer to join our ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools and methodologies… more
- NVIDIA (Santa Clara, CA)
- …Be Doing: + Primary responsibilities will include building and enabling robust AI / HPC infrastructure for customers + Support operational and reliability aspects ... be part of the team that brings Artificial Intelligence ( AI ) emerging technology to the field? We are looking...in working with customers + Expertise with parallel file systems (eg Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects… more
- Massachusetts Institute of Technology (Cambridge, MA)
- Senior HPC Systems Engineer +...deploying, maintaining, and optimizing HPC clusters, storage systems , and networking for AI /ML workloads. Join a ... Email a Friend Save Save Apply Now Posting Description SENIOR HPC SYSTEMS ENGINEER, The...and container orchestration tools like Docker and Kubernetes; and experience in cloud-based HPC or AI /ML workloads.… more
- Texas A&M University System (College Station, TX)
- Job Title Senior HPC Engineer Agency Texas A&M...expertise and consultation for the design and deployment of HPC systems . Get in on the ground ... firmware patching, and performance tuning.* Oversee networking, security, and infrastructure for HPC systems .* Lead the development of specialized HPC … more