• AI / HPC System

    Meta (New York, NY)
    …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
    Meta (11/06/25)
    - Related Jobs
  • Technical Lead, HPC & AI

    Rensselaer Polytechnic Institute (Troy, NY)
    AI is a senior member of the team responsible for the design and implementation of HPC and AI systems . The Technical Lead also develops and aids in the ... Skills, and Abilities + Experience with design, deployment, and management of HPC systems including storage, file systems , networking, virtualization,… more
    Rensselaer Polytechnic Institute (11/06/25)
    - Related Jobs
  • Senior Software Engineer- AI Hardware

    Bloomberg (New York, NY)
    …and maintenance of our HPC / AI clusters, ensuring peak performance and reliability + Drive system upgrades, customization, and seamless integration ... enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems . This...overseeing the ongoing monitoring, support, and maintenance of our HPC / AI clusters, ensuring peak performance more
    Bloomberg (10/01/25)
    - Related Jobs
  • Technical Program Manager, AI Network Infra

    Meta (New York, NY)
    AI product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible ... deliver on shared goals 10. The ideal candidate will have experience in AI / HPC product development and operations, demonstrated experience in the Network… more
    Meta (10/05/25)
    - Related Jobs
  • AI Engineering Manager/Solutions Architect…

    Deloitte (New York, NY)
    …Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack Information for applicants with a need for ... in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns + Define and lead technology… more
    Deloitte (09/11/25)
    - Related Jobs
  • Machine Learning Engineer I - Multimodal…

    Mount Sinai Health System (New York, NY)
    …:** Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 48,000 employees working across eight ... the development and enhancement of machine learning applications and systems . They will work closely with other engineers and...I to support the lab's core projects in multimodal AI for women's health. The engineer will be responsible… more
    Mount Sinai Health System (10/14/25)
    - Related Jobs
  • Intern: Hybrid Cloud and Quantum Research…

    IBM (Yorktown Heights, NY)
    …Python. Rust, CUDA * Familiarity with executing HPC workloads * Familiarity with HPC system performance evaluation. At IBM, we pride ourselves on being ... technical areas in the context of hybrid cloud, AI systems , networking, security, high-speed networked-storage, accelerators, and HPC principles. The… more
    IBM (10/19/25)
    - Related Jobs
  • Principal Systems Performance

    GE Aerospace (Niskayuna, NY)
    performance metrics + In depth experience in applying system performance improvement for enterprise and cyber-physical systems . + Demonstrated development ... twin concepts and experience with applying analytics, simulation, optimization, AI /ML based software to large complex systems ... performance improvement for enterprise and cyber physical systems at the system and subsystem level… more
    GE Aerospace (10/21/25)
    - Related Jobs
  • Senior Field Services Tech ( Systems

    Huntington Ingalls Industries (Syracuse, NY)
    …closely with IT infrastructure teams, software vendors, and engineering departments to optimize system performance and contribute to the IT Roadmap. * Provide ... critical CAD and PLM software used throughout our shipbuilding projects in a high- performance computing ( HPC ) setting. If you are passionate about learning and… more
    Huntington Ingalls Industries (10/18/25)
    - Related Jobs
  • Software Engineer, SystemML - Scaling…

    Meta (New York, NY)
    …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more
    Meta (11/05/25)
    - Related Jobs