- Meta (New York, NY)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
- Rensselaer Polytechnic Institute (Troy, NY)
- … AI is a senior member of the team responsible for the design and implementation of HPC and AI systems . The Technical Lead also develops and aids in the ... Skills, and Abilities + Experience with design, deployment, and management of HPC systems including storage, file systems , networking, virtualization,… more
- Bloomberg (New York, NY)
- …and maintenance of our HPC / AI clusters, ensuring peak performance and reliability + Drive system upgrades, customization, and seamless integration ... enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems . This...overseeing the ongoing monitoring, support, and maintenance of our HPC / AI clusters, ensuring peak performance … more
- Meta (New York, NY)
- … AI product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible ... deliver on shared goals 10. The ideal candidate will have experience in AI / HPC product development and operations, demonstrated experience in the Network… more
- Deloitte (New York, NY)
- …Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack Information for applicants with a need for ... in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns + Define and lead technology… more
- Mount Sinai Health System (New York, NY)
- …:** Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 48,000 employees working across eight ... the development and enhancement of machine learning applications and systems . They will work closely with other engineers and...I to support the lab's core projects in multimodal AI for women's health. The engineer will be responsible… more
- IBM (Yorktown Heights, NY)
- …Python. Rust, CUDA * Familiarity with executing HPC workloads * Familiarity with HPC system performance evaluation. At IBM, we pride ourselves on being ... technical areas in the context of hybrid cloud, AI systems , networking, security, high-speed networked-storage, accelerators, and HPC principles. The… more
- GE Aerospace (Niskayuna, NY)
- …performance metrics + In depth experience in applying system performance improvement for enterprise and cyber-physical systems . + Demonstrated development ... twin concepts and experience with applying analytics, simulation, optimization, AI /ML based software to large complex systems ... performance improvement for enterprise and cyber physical systems at the system and subsystem level… more
- Huntington Ingalls Industries (Syracuse, NY)
- …closely with IT infrastructure teams, software vendors, and engineering departments to optimize system performance and contribute to the IT Roadmap. * Provide ... critical CAD and PLM software used throughout our shipbuilding projects in a high- performance computing ( HPC ) setting. If you are passionate about learning and… more
- Meta (New York, NY)
- …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more