- Rensselaer Polytechnic Institute (Troy, NY)
- … AI is a senior member of the team responsible for the design and implementation of HPC and AI systems . The Technical Lead also develops and aids in the ... Skills, and Abilities + Experience with design, deployment, and management of HPC systems including storage, file systems , networking, virtualization,… more
- Meta (New York, NY)
- …and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more
- Bloomberg (New York, NY)
- …maintaining system software that enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems . This role will also be responsible ... overseeing the ongoing monitoring, support, and maintenance of our HPC / AI clusters, ensuring peak performance ...enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems +… more
- IBM (Yorktown Heights, NY)
- …technical areas in the context of hybrid cloud, AI systems , networking, security, high-speed networked-storage, accelerators, and HPC principles. The ... focuses on the next generation Hybrid Cloud infrastructure for AI , Storages, HPC and Quantum applications. The...experience with Git * HPC : experience running HPC workloads on HPC systems … more
- GE Aerospace (Niskayuna, NY)
- …finding a better way to climb higher together. We were meant to fly. As Systems Performance Modeling Engineer, within Digital and Electrical Systems (DES)- ... designing, testing and validating statistical and mathematical methods to ensure target performance of enterprise and cyber physical systems within GE… more
- Meta (New York, NY)
- …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more
- Legrand NA (Farmingdale, NY)
- …the development and execution of programs to promote Legrand USystems' cooling solutions for AI and HPC data centers across North America. This role involves ... cooling systems , air cooling technologies, and thermal management products tailored for AI and HPC data centers. + Partner with cross-functional teams to… more
- Mount Sinai Health System (New York, NY)
- …for contributing to the development and enhancement of machine learning applications and systems . They will work closely with other engineers and data scientists to ... design and implement scalable and efficient machine learning systems . We are recruiting a Machine Learning Engineer I...I to support the lab's core projects in multimodal AI for women's health. The engineer will be responsible… more
- NVIDIA (NY)
- …top minds with Financial Services Capital Markets and Exchange firms to accelerate High- Performance Computing and AI workloads across various use cases. We're ... and optimization of machine learning/deep learning models to ensure the best performance on current- and next-generation GPU architectures. + Work directly with… more
- Memorial Sloan-Kettering Cancer Center (New York, NY)
- …AI development with healthcare delivery. + Drive the adoption of scalable AI solutions that meet clinical performance and reliability standards. **Key ... at MSK combine advanced statistical methods, deep learning, and high- performance computing to extract insights from complex datasets-particularly in medical… more