• Research Data Center Facility Engineer

    Stanford University (Stanford, CA)
    …researchers from a variety of Stanford and SLAC organizations. The majority of the HPC systems are hosted in the Stanford Research Computing Facility (SRCF), ... Research Data Center Facility Engineer **Business Affairs: University IT (UIT), Stanford, California,...Stanford Research Computing. Research Computing offers High Performance Computing ( HPC ) hosting services, computational and data systems ,… more
    Stanford University (07/18/25)
    - Related Jobs
  • Analytics DevOps and Platform Engineer

    UCLA Health (Los Angeles, CA)
    …UCLA Health IT is looking for an outstanding Analytics DevOps and Platform Engineer , (IT Architect), to join the Solutions Architecture and Engineering (SAE) group. ... will possess a well-rounded skillset encompassing software development, knowledge of HPC and Citrix environments, and relevant cloud certifications. We are looking… more
    UCLA Health (05/22/25)
    - Related Jobs
  • Sr Staff Engineer , ML Infrastructure…

    LinkedIn (Mountain View, CA)
    …industry experience. 8+ years of experience designing and managing large-scale, distributed systems or HPC environments, with at least 3+ years focused ... LinkedIn is the world's largest professional network , built to create economic opportunity for every...About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU… more
    LinkedIn (07/18/25)
    - Related Jobs
  • Senior Software Engineer , GPU…

    NVIDIA (Santa Clara, CA)
    …wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and network ... crew that develops and maintains software for complex heterogeneous computing systems that power disruptive products in High Performance Computing and Deep… more
    NVIDIA (06/12/25)
    - Related Jobs
  • Software Engineer , SystemML - Scaling…

    Meta (Menlo Park, CA)
    **Summary:** In this role, you will be a member of the Network .AI Software team and part of the bigger DC networking organization. The team develops and owns the ... Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on… more
    Meta (07/18/25)
    - Related Jobs
  • Software Engineer , SystemML - AI…

    Meta (Menlo Park, CA)
    …Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on ... (eg Large-Scale GenAI/LLM training) from the trainer down to the inter-GPU and network communication layer. And we are seeking for engineers to work on the… more
    Meta (04/22/25)
    - Related Jobs
  • Software Engineer - Datacenter networking

    Meta (Menlo Park, CA)
    …Meta's global data center networks. Our work covers the entire network lifecycle, including hardware development, capacity planning, distributed and centralized ... control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure analysis.We are actively seeking Software… more
    Meta (07/08/25)
    - Related Jobs
  • High Performance Compute Director

    Microsoft Corporation (Mountain View, CA)
    …or related field AND 6+ years technical experience in software engineering, network engineering, service engineering, systems engineering, or industrial controls ... related field AND 12+ years technical experience in software engineering, network engineering, service engineering, systems engineering, or industrial controls… more
    Microsoft Corporation (07/11/25)
    - Related Jobs
  • High-performance AI compute engineer

    Cisco (San Jose, CA)
    …+ Design, develop, and maintain device drivers and runtime components for GPU and network components of the systems . + Working with kernel and platform ... High-performance AI compute engineer Apply (https://jobs.cisco.com/jobs/Login?projectId=1445895) + Location:San Jose, California, US...by bold ideas and a shared goal: to rethink systems from the ground up and deliver breakthrough solutions… more
    Cisco (07/19/25)
    - Related Jobs
  • Field Application Engineer (Machine…

    quadric.io, Inc (Burlingame, CA)
    …battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems . Unlike other NPUs or neural network accelerators in the ... co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of...C++ DSP and control code. Role: The Field Application Engineer (FAE) will work closely with Business Development, Product,… more
    quadric.io, Inc (06/09/25)
    - Related Jobs