• Senior HPC Performance Engineer

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
    NVIDIA (05/05/25)
    - Related Jobs
  • Senior HPC Engineer , Infrastructure…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior HPC Engineer to join its Infrastructure Specialists team. Academic, commercial and government groups around the world are using ... and to power data centers. Join the team building many of the largest and fastest AI/ HPC systems in the world! NVIDIA is looking for someone with the ability to… more
    NVIDIA (06/12/25)
    - Related Jobs
  • AI/ HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: … more
    Meta (06/18/25)
    - Related Jobs
  • Network Development Engineer I,…

    Amazon (Cupertino, CA)
    Description Do you like to use network and Unix systems engineering to deliver simple, sustainable, and repeatable solutions? Would you like to play a key role ... to own them to completion. The Core Networking team is looking for a Network Development Engineer to join our Network Fabric Engineering (NFE) team. As a … more
    Amazon (07/08/25)
    - Related Jobs
  • AI Infrastructure Engineer - HPC

    Cisco (San Jose, CA)
    AI Infrastructure Engineer - HPC Apply (https://jobs.cisco.com/jobs/Login?projectId=1443781) + Location:San Jose, California, US + Alternate LocationAnywhere is ... of the Internet to Showcase the power of Cisco: our people, products, processes, systems , and data. Please join us and make this journey together! **Your Impact**… more
    Cisco (07/15/25)
    - Related Jobs
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are the ... and life cycle of servers in production. **Required Skills:** Production Systems Engineer , Sustaining Responsibilities: 1. Develop robust, industry leading… more
    Meta (06/25/25)
    - Related Jobs
  • Software Engineer , Accelerator…

    Meta (Menlo Park, CA)
    …for AI/ HPC workloads). 13. Full-stack experience and understanding of AI/ HPC systems , from HW/infrastructure through the application layer, performance ... in some of the world's largest scale clusters. **Required Skills:** Software Engineer , Accelerator Systems & Technologies Responsibilities: 1. Understand and… more
    Meta (05/01/25)
    - Related Jobs
  • Senior Systems Engineer

    Microsoft Corporation (Mountain View, CA)
    …(SDCS) team, embedded within the broader silicon engineering organization. As a Senior Systems Engineer , you will play a critical role in designing, deploying, ... of a globally distributed design organization. If you are passionate about Linux systems at scale, HPC infrastructure, and enabling cutting-edge silicon design,… more
    Microsoft Corporation (07/18/25)
    - Related Jobs
  • Senior System Software Engineer , NCCL…

    NVIDIA (Santa Clara, CA)
    …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
    NVIDIA (07/07/25)
    - Related Jobs
  • Hardware Systems Engineer , AI…

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking a Hardware Systems Engineer to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data centers ... productizing high-performance software and hardware technologies for AI at datacenter scale.Hardware Systems Engineer in RTP work closely with HW/SW co-design… more
    Meta (06/25/25)
    - Related Jobs