- Meta (New York, NY)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
- NVIDIA (Santa Clara, CA)
- …fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools… more
- Rensselaer Polytechnic Institute (Troy, NY)
- … AI is a senior member of the team responsible for the design and implementation of HPC and AI systems . The Technical Lead also develops and aids in the ... Skills, and Abilities + Experience with design, deployment, and management of HPC systems including storage, file systems , networking, virtualization,… more
- Lilly (Indianapolis, IN)
- …infrastructure! The Cloud and Connectivity organization is seeking experts and leaders in AI and High- Performance Computing ( HPC ), and Nvidia DGX server ... of advanced Linux platforms supporting AI and HPC workloads, managing Nvidia DGX systems using...expert. **What You Should Bring** + Expertise in Linux system administration, HPC environments, and Nvidia DGX… more
- NVIDIA (Santa Clara, CA)
- …with AI / HPC workflows that use MPI + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Passion for continual learning ... GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek a...storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning… more
- NVIDIA (Santa Clara, CA)
- …analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... and implement GPU compute clusters for deep learning and high- performance computing. What you'll be doing: + Provide leadership...storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning… more
- NVIDIA (Santa Clara, CA)
- …, time-series databases, and large-scale monitoring systems . + Familiarity with AI /ML pipelines, GPU-based workloads , and HPC environments. + Experience ... teams to optimize observability for model training, inference workloads, and HPC performance . + Leverage machine learning and statistical techniques… more
- University of Pennsylvania (Philadelphia, PA)
- …+ Optimize, monitor, and troubleshoot HPC file systems for performance and reliability. + Conduct system benchmarking and develop automated testing to ... facility is seeking a highly qualified and motivated High Performance Computing ( HPC ) Systems Engineer...to join the team. PARCC's main cluster (Betty), delivers HPC , data-intensive science and Artificial Intelligence ( AI )… more
- Amazon (Austin, TX)
- …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you enjoy tackling large analytical problems as ... - helping them envision and build the future of high- performance computing. Your technical solutions and insights will shape...solutions and insights will shape how partners transform their HPC approaches for the AI era. AWS… more
- Bloomberg (New York, NY)
- …and maintenance of our HPC / AI clusters, ensuring peak performance and reliability + Drive system upgrades, customization, and seamless integration ... enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems . This...overseeing the ongoing monitoring, support, and maintenance of our HPC / AI clusters, ensuring peak performance … more
Recent Searches
- Flex Part Time Security (Virginia)
- Cyber Capability Developer Multiple (Washington, DC)
- Call Data Center Tech (United States)
- Software Integrator Tactical USMC (United States)
Recent Jobs
-
Director, Equity Sales
- CIBC (New York, NY)
-
Surface Maintenance Mechanic
- Army National Guard Units (Guernsey, WY)
-
Food Operations Manager 2
- Sodexo (Orangeburg, SC)
-
Summer Nurse Intern
- Fairview Health Services (St. Paul, MN)