- Meta (Austin, TX)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
- NVIDIA (Santa Clara, CA)
- …analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... and implement GPU compute clusters for deep learning and high- performance computing. What you'll be doing: + Provide leadership...storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning… more
- Mayo Clinic (Rochester, MN)
- …computing ( HPC ) environments and the ability to optimize and maintain complex HPC systems . Works independently on projects and assignments within scope of ... seeking a highly skilled and motivated Tech Spec I HPC Engineer to join the HPC Team....SuperPod deployments, with a strong focus on parallel file systems , Kubernetes, containerization, Slurm schedulers, Nvidia Base Command, DDN… more
- Cisco (San Jose, CA)
- …future of AI infrastructure - we'd love to meet you. **Impact** As **High- performance AI compute engineer** , you will be instrumental in defining and ... Principal Engineer - HPC , AI Infrastructure Apply (https://jobs.cisco.com/jobs/Login?projectId=1445895) + Location:San Jose, California, US + Area of… more
- NVIDIA (Santa Clara, CA)
- …fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools… more
- NVIDIA (Santa Clara, CA)
- …group at NVIDIA has openings for software architects in the field of AI and high- performance networking and system software. We research, develop, and ... and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new… more
- NVIDIA (Santa Clara, CA)
- …Be Doing: + Primary responsibilities will include building and enabling robust AI / HPC infrastructure for customers + Support operational and reliability aspects ... of large-scale AI clusters, focusing on performance at scale,...in working with customers + Expertise with parallel file systems (eg Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects… more
- Amazon (Herndon, VA)
- …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
- General Dynamics Information Technology (Vicksburg, MS)
- …expectations. Familiarity with commonly used HPC services (ie high performance file systems , modules for installing applications, compilers, MPI, OpenMP, ... career. At GDIT, people are our differentiator. As a HPC Computational Scientist supporting High Performance Computing...expert computational support for users of the supercomputing computing systems within the DoD. The team serves as a… more