• Senior HPC and AI

    NVIDIA (Santa Clara, CA)
    …love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis Engineer to join ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...Deep Learning LLM training focused on collectives communication and networking . You will interact with many types of hardware… more
    NVIDIA (09/03/25)
    - Related Jobs
  • Senior Product Architect, HPC

    NVIDIA (Santa Clara, CA)
    …our team, you'll design and shape the architectures that connect the world's most powerful AI clusters. As an HPC Networking Product Architect at NVIDIA, ... scalability. + Experience working with benchmarking tools and performance analysis for large-scale HPC / AI networking deployments. + Understanding of DPU (or… more
    NVIDIA (10/02/25)
    - Related Jobs
  • Senior AI and ML HPC Cluster…

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking , and storage. +… more
    NVIDIA (10/19/25)
    - Related Jobs
  • Senior AI - HPC Cluster…

    NVIDIA (Santa Clara, CA)
    …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking , and storage. + ... of experience crafting and operating large scale compute infrastructure. + Experience with AI / HPC job schedulers and orchestrators, such as Slurm, K8s or LSF.… more
    NVIDIA (07/31/25)
    - Related Jobs
  • Sr. Software Development Engineer, HPC /ML…

    Amazon (Cupertino, CA)
    …is important. Experience with embedded systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving ... are seeking an experienced engineer to work on distributed AI /ML systems. This role involves working on collective operations...hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful… more
    Amazon (08/11/25)
    - Related Jobs
  • Senior Software Architect, AI

    NVIDIA (Santa Clara, CA)
    …group at NVIDIA has openings for software architects in the field of AI and high-performance networking and system software. We research, develop, and ... deploy solutions in networking hardware, programming environments, and system software to make...+ Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI,… more
    NVIDIA (07/31/25)
    - Related Jobs
  • Senior HPC Cluster Engineer - EDA

    NVIDIA (Santa Clara, CA)
    …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking , and storage. + ... tools such as BCM or Ansible. + Experience with AI / HPC job schedulers and orchestrators, such as...supporting EDA workloads and tools. + Familiarity with High-Speed Networking pertaining to HPC including InfiniBand, RDMA… more
    NVIDIA (09/17/25)
    - Related Jobs
  • Senior Software Architect - Deep Learning…

    NVIDIA (Santa Clara, CA)
    …like NCCL, NVSHMEM, and UCX that are crucial for scaling Deep Learning and HPC . We're seeking a Senior Software Architect to help co-design next-gen data ... (eg NVLink, PCIe) within a node and with high-speed networking (eg InfiniBand, Ethernet) across nodes. Efficient and fast...+ Design and implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative… more
    NVIDIA (07/29/25)
    - Related Jobs
  • Senior Solutions Architect, HPC

    NVIDIA (Santa Clara, CA)
    …ecosystems. You'll be called on to help architect and scale high-performance, distributed AI infrastructure on-prem or in the cloud built with the latest NVIDIA GPU ... PCIe topology, CPUs, GPUs, NICs, Linux OS, and kernel drivers. + Networking experience, including knowledge of Ethernet, InfiniBand or other networking more
    NVIDIA (10/01/25)
    - Related Jobs
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Santa Clara, CA)
    …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...- Knowledge of the underlying infrastructure requirements such as Networking , Storage, and Hardware Optimization. - Experience in a… more
    Amazon (09/11/25)
    - Related Jobs