- NVIDIA (Santa Clara, CA)
- …of experience crafting and operating large scale compute infrastructure. + Experience with AI / HPC job schedulers and orchestrators, such as Slurm, K8s or LSF. ... Applied experience with AI / HPC workflows that use MPI and NCCL. + Proficient in using Linux including Centos/RHEL and/or Ubuntu Linux distributions. A solid… more
- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... years of experience designing and operating large scale compute infrastructure + Experience with AI / HPC advanced job schedulers, such as Slurm, K8s, PBS, RTDA or… more
- NVIDIA (Santa Clara, CA)
- …a lasting impact on the world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA ... experience crafting and operating large scale compute infrastructure, including cluster configuration managements tools such as BCM or Ansible....tools such as BCM or Ansible. + Experience with AI / HPC job schedulers and orchestrators, such as… more
- Texas A&M University System (College Station, TX)
- Job Title Senior HPC Engineer Agency Texas A&M University Department Technology Services - IT Enterprise Operations Proposed Minimum Salary Commensurate Job ... sensitive requiring US Citizenship. Opportunities to Contribute * Manage large-scale HPC cluster operations, including OS upgrades, firmware patching, and… more
- University of Pennsylvania (Philadelphia, PA)
- …Engineer to join the team. PARCC's main cluster (Betty), delivers HPC , data-intensive science and Artificial Intelligence ( AI ) resources to researchers at ... Continuously assess emerging tools and technologies for integration into current and future HPC cluster environments. + Actively mentor and support the training… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly… more
- NVIDIA (TX)
- …Do you want to be part of a team that brings new Artificial Intelligence ( AI ) hardware and software technologies to production in customer data centers? As part of ... What you will be doing: + Working with NVIDIA AI Native, Consumer Internet and Enterprise customers on large...on network design, compute/storage and support bring up of server/network/ cluster deployments. You will need to visit customer data… more
- NVIDIA (Santa Clara, CA)
- Join the NVIDIA Deep Learning Frameworks Infrastructure team as a Senior Systems Engineer focusing on High-Performance AI & Networking Applications, committed to ... equivalent experience. + 8+ years of proven experience in AI / HPC Infrastructure. + Familiarity with AI...NCCL, NIXL, NVSHMEM, UCX. + Experience developing or maintaining cluster management and monitoring tools Ex: ansible for infrastructure… more
- NVIDIA (MA)
- …an experienced systems architect with an interest in advancing artificial intelligence ( AI ) and high-performance computing ( HPC ) in academic and research ... requires a strong background in building and deploying research computing clusters, deploying AI and HPC workloads, and optimizing system performance at scale.… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI . We are currently ... Senior Software Engineer to lead the development of AI software resiliency for the most powerful AI...GPUs. Your expertise will be crucial in driving down cluster downtime towards zero, ensuring that our AI… more
Recent Searches
- Senior Data Engineer Java (United States)
- Java Application Support (United States)
- Sr Python Developer (United States)
- Management Development Program Structured (United States)
Recent Jobs
-
Sr Planning Analyst
- US Tech Solutions (Charlotte, NC)
-
Tech III
- Netsync Network Solutions (Houston, TX)
-
Electronic Technician II (RIS)
- Amentum (Tyndall AFB, FL)
-
Acquisition Specialist (Multiple Levels)
- Noblis (Washington, DC)