• Staff Software Engineer, Parallel File System,…

    Google (Sunnyvale, CA)
    …coding in C++. + 3 years of experience with distributed or parallel file systems or storage systems . Preferred qualifications: + Master's degree or PhD in ... field. + 5 years of experience with developing infrastructure, distributed systems , or networks, or with compute technologies, storage, or hardware architecture.… more
    Google (05/04/25)
    - Related Jobs
  • Software Engineer, Accelerator Systems

    Meta (Menlo Park, CA)
    …11. Full-stack experience and understanding of AI / HPC systems , from HW/infrastructure through the application layer, performance optimizations, including ... learning domains: hardware accelerators, AI Infrastructure, and/or high performance computing ( HPC ), particularly pertaining to interconnect and collective.… more
    Meta (05/01/25)
    - Related Jobs
  • System Engineer - Interconnect

    Meta (Menlo Park, CA)
    …custom AI hardware 2. Collect requirements and develop specifications for Rackscale AI / HPC systems . 3. Develop and maintain code, to collect, analyze, ... Meta is developing one of the world's highest performant AI / HPC clusters using custom-designed AI ...cross-functional engineering environment. They will solve complex problems in high- performance AI that span across silicon, hardware,… more
    Meta (04/19/25)
    - Related Jobs
  • Software Engineer, Accelerator Solutions…

    Meta (Menlo Park, CA)
    …**Preferred Qualifications:** Preferred Qualifications: 15. Full-stack experience and understanding of AI / HPC systems , from hardware and infrastructure ... ML domains: hardware accelerators, AI Infrastructure, and/or high performance compute ( HPC ), particularly pertaining to interconnect and collective.… more
    Meta (05/01/25)
    - Related Jobs
  • Senior Performance Engineer

    NVIDIA (Santa Clara, CA)
    …impact on the world. We are looking for an outstanding engineer for a Senior Performance Engineer role for at scale AI system performance and datacenter ... develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems...+ Deliver engineering solutions to deliver continuous insights into performance of AI workloads over evolving environments,… more
    NVIDIA (04/30/25)
    - Related Jobs
  • Sr. Software Development Engineer, ML…

    Amazon (Cupertino, CA)
    …are used to guarantee top performance of AWS ML and High Performance Computing ( HPC ) technologies developed by our organization. Bring your exceptional ... knowledge of CI/CD automation, ML and HPC benchmarks and applications to bear on the cutting-edge...Join us as we expand the AWS offerings for AI , including Trainium, Graviton and the Elastic Fabric Adapter… more
    Amazon (02/15/25)
    - Related Jobs
  • Sr Staff Engineer, ML Infrastructure…

    LinkedIn (Mountain View, CA)
    …parallel file systems , object storage, NVMe over Fabric) to meet performance and capacity requirements for ML workloads. Collaborate with network and storage ... our large-scale GPU infrastructure for machine learning (ML) and AI workloads. In this role, you will be the...8+ years of experience designing and managing large-scale, distributed systems or HPC environments, with at least… more
    LinkedIn (04/18/25)
    - Related Jobs
  • Senior System Software Engineer, NCCL - Partner…

    NVIDIA (Santa Clara, CA)
    …with engineering or academic research community supporting HPC or AI + Practical experience with high performance networking: Infiniband/RoCE/Ethernet ... to get an end to end understanding of the AI networking stack. Are you ready for to contribute...to stand out from the crowd: + Experience conducting performance benchmarking and developing infrastructure on HPC more
    NVIDIA (04/22/25)
    - Related Jobs
  • GPU Compiler Performance Engineer

    Qualcomm (Santa Clara, CA)
    …characterize trending GPU benchmarks and applications (games, HPC , AR/VR and AI ) + Use/develop tools to identify performance bottlenecks and study ... of parallel computing on multi-core CPU, GPU, or heterogeneous systems + Extensive experience with benchmarking and performance...with performance profiling and modeling for games, HPC , AR/VR, or AI applications + Experience… more
    Qualcomm (04/16/25)
    - Related Jobs
  • Software Engineer, Accelerator Solutions…

    Meta (Menlo Park, CA)
    …training **Preferred Qualifications:** Preferred Qualifications: 15. Full-stack experience and understanding of AI / HPC systems , with a focus on the ... of Meta's accelerators collective communications software library and optimizing distributed AI /ML workloads' performance . This is an opportunity to work… more
    Meta (05/03/25)
    - Related Jobs