• HPC Systems Engineer (Top…

    SpaceX (Hawthorne, CA)
    HPC Systems Engineer (Top...1+ years of professional experience building, deploying and troubleshooting Linux systems . + Experience with a scripting ... (TOP SECRET CLEARANCE) SpaceX is looking for an HPC Systems Engineer with strong...SpaceX employees across engineering disciplines. + Install and integrate Linux -based compute clusters. + Write instructional documentation and convey… more
    SpaceX (04/15/25)
    - Related Jobs
  • Principal or Senior Principal HPC

    Northrop Grumman (Redondo Beach, CA)
    …code deployment, maintenance, and optimization efforts. The lessons learned from existing HPC systems will inform the architecture, deployment, and utilization ... but are not limited to: + Develop and deploy architectures for future HPC systems based on engineering computing requirements, collaborating with engineering to… more
    Northrop Grumman (06/14/25)
    - Related Jobs
  • Senior AI- HPC Cluster Engineer

    NVIDIA (Santa Clara, CA)
    …doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... or LSF + Proficient in administering Centos/RHEL and/or Ubuntu Linux distributions + Solid understanding of cluster configuration managements...IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC more
    NVIDIA (04/02/25)
    - Related Jobs
  • Senior AI- HPC Storage Engineer

    NVIDIA (Santa Clara, CA)
    …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
    NVIDIA (05/07/25)
    - Related Jobs
  • Senior HPC Engineer , Infrastructure…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior HPC Engineer to join its Infrastructure Specialists team. Academic, commercial and government groups around the world are using ... the team building many of the largest and fastest AI/ HPC systems in the world! NVIDIA is...+ Primary responsibilities will include deploying, managing, and validating AI/ HPC infrastructure in Linux -based environments for new… more
    NVIDIA (06/12/25)
    - Related Jobs
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    Description We are seeking an experienced engineer to work on distributed AI/ML systems . This role involves working on collective operations - the fundamental ... Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux , kernels, and performant code is important. Experience with embedded systems is… more
    Amazon (05/14/25)
    - Related Jobs
  • Senior Software Engineer - HPC

    NVIDIA (Santa Clara, CA)
    …long term maintenance strategy. What you'll be doing: + Design highly available and scalable systems to meet the demands of our HPC clusters + Evaluate new and ... us today. We are looking for a Senior Software Engineer to join our mission to continue improving our... to join our mission to continue improving our HPC infrastructure. Our team builds and operates sophisticated infrastructure… more
    NVIDIA (05/28/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are ... responsible for the big picture of how our systems relate to each other, we use a breadth...and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to… more
    NVIDIA (04/04/25)
    - Related Jobs
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …engineers with systems knowledge and experience in area such as Linux OS boot sequencing, Kernel, Hypervisor (Xen or KVM), peripheral device development (PCIe ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...High performance computing workloads. The Nitro High Memory and HPC team owns the purpose built platform development for… more
    Amazon (04/29/25)
    - Related Jobs
  • Senior Solutions Architect, HPC

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial ... center GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design, compute/storage and support bring… more
    NVIDIA (06/05/25)
    - Related Jobs