• AI / HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like… more
    Meta (06/18/25)
    - Related Jobs
  • Senior HPC Engineer , Infrastructure…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior HPC Engineer to join its Infrastructure Specialists team. Academic, commercial and government groups around the world are using ... and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! NVIDIA is looking for someone with the ability to… more
    NVIDIA (06/12/25)
    - Related Jobs
  • Senior Solutions Architect, HPC Systems…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial ... Intelligence ( AI ) hardware and software technologies to production in customer...GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design,… more
    NVIDIA (06/05/25)
    - Related Jobs
  • Software Engineer , SystemML - AI

    Meta (Menlo Park, CA)
    …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Tech-leading the ... this role, you will be a member of the AI Networking Software team and part of the bigger...Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and… more
    Meta (04/22/25)
    - Related Jobs
  • Hardware Systems Engineer , AI

    Meta (Menlo Park, CA)
    …in exploring, developing and productizing high-performance software and hardware technologies for AI at datacenter scale.Hardware Systems Engineer in RTP work ... and optimize these systems in production. **Required Skills:** Hardware Systems Engineer , AI Systems Responsibilities: 1. Interface with external vendors… more
    Meta (06/25/25)
    - Related Jobs
  • Sr. Hardware Dev Engineer (AWS Generative…

    Amazon (Cupertino, CA)
    …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. AWS Infrastructure Services owns the design, planning, ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
    Amazon (06/24/25)
    - Related Jobs
  • AI Applications Engineer

    quadric.io, Inc (Burlingame, CA)
    …(GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint ... or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only...C++ DSP and control code. Role: The Corporate Applications Engineer is the key bridge between development engineering and… more
    quadric.io, Inc (06/13/25)
    - Related Jobs
  • Production Systems Engineer , Sustaining

    Meta (Menlo Park, CA)
    …hardware requirements and specifications (eg, configuring hardware components, GPU, memory, network for AI / HPC workloads) **Public Compensation:** ... **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP)...Responsibilities: 1. Develop robust, industry leading practices for supporting AI and HPC infrastructure at scale 2.… more
    Meta (06/25/25)
    - Related Jobs
  • R&D Applications Engineer

    Broadcom (San Jose, CA)
    …the latest Broadcom switch platforms and emerging network technologies optimized for AI and HPC workloads. + Contribute to hardware and low-level software ... Broadcom high-speed Ethernet switch solutions, specifically designed to accelerate AI /ML and High-Performance Computing ( HPC ) workloads. Our products… more
    Broadcom (05/21/25)
    - Related Jobs
  • Software Engineer , Accelerator Systems…

    Meta (Menlo Park, CA)
    HPC hardware requirements and specifications (eg, configuring hardware components, GPU, memory, network for AI / HPC workloads). 14. Understanding of the ... Qualifications:** Preferred Qualifications: 11. Full-stack experience and understanding of AI / HPC systems, from HW/infrastructure through the application layer,… more
    Meta (05/01/25)
    - Related Jobs