- NVIDIA (Santa Clara, CA)
- …love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis Engineer to join ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...Deep Learning LLM training focused on collectives communication and networking . You will interact with many types of hardware… more
- NVIDIA (Santa Clara, CA)
- …our team, you'll design and shape the architectures that connect the world's most powerful AI clusters. As an HPC Networking Product Architect at NVIDIA, ... scalability. + Experience working with benchmarking tools and performance analysis for large-scale HPC / AI networking deployments. + Understanding of DPU (or… more
- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking , and storage. +… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking , and storage. + ... of experience crafting and operating large scale compute infrastructure. + Experience with AI / HPC job schedulers and orchestrators, such as Slurm, K8s or LSF.… more
- Amazon (Cupertino, CA)
- …is important. Experience with embedded systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving ... are seeking an experienced engineer to work on distributed AI /ML systems. This role involves working on collective operations...hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful… more
- NVIDIA (Santa Clara, CA)
- …group at NVIDIA has openings for software architects in the field of AI and high-performance networking and system software. We research, develop, and ... deploy solutions in networking hardware, programming environments, and system software to make...+ Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI,… more
- Texas A&M University System (College Station, TX)
- Job Title Senior HPC Engineer Agency Texas A&M University Department Technology Services - IT Enterprise Operations Proposed Minimum Salary Commensurate Job ... faculty and staff providing cutting-edge research and super computing needs. As a Senior High Performance Computing Engineer ( HPC ), you will provide technical… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking , and storage. + ... tools such as BCM or Ansible. + Experience with AI / HPC job schedulers and orchestrators, such as...supporting EDA workloads and tools. + Familiarity with High-Speed Networking pertaining to HPC including InfiniBand, RDMA… more
- NVIDIA (Santa Clara, CA)
- …like NCCL, NVSHMEM, and UCX that are crucial for scaling Deep Learning and HPC . We're seeking a Senior Software Architect to help co-design next-gen data ... (eg NVLink, PCIe) within a node and with high-speed networking (eg InfiniBand, Ethernet) across nodes. Efficient and fast...+ Design and implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative… more
- University of Pennsylvania (Philadelphia, PA)
- …with the university's central networking group (ISC). + Apply HPC networking configurations and security protocols to optimize resource utilization ... Job Title HPC Systems Engineer Job Profile Title Systems Administrator Senior Job Description Summary The Penn Advanced Research Computing Center (PARCC) core… more
Recent Jobs
-
Commercial Account Specialist
- UMB Bank (Kansas City, MO)
-
Senior Adobe Commerce Developer
- CGI Technologies and Solutions, Inc. (Alexandria, VA)
-
Sr Compliance Coding Analyst
- Rush University Medical Center (Chicago, IL)
-
Rental Account Manager - Cold Storage
- Trane Technologies (Rocklin, CA)