• Senior HPC Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior HPC Engineer to join its Infrastructure Specialists team. Academic, commercial and government groups around the world are ... be doing: + Primary responsibilities will include deploying, managing, and validating AI/ HPC infrastructure in Linux-based environments for new and existing… more
    NVIDIA (06/12/25)
    - Related Jobs
  • Senior HPC Performance…

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of...Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information + Collaborate… more
    NVIDIA (05/05/25)
    - Related Jobs
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    … Software Engineer to join our mission to continue improving our HPC infrastructure . Our team builds and operates sophisticated infrastructure to ... to provide better tools to build and manage this infrastructure . Ideal candidate is strong in software development, designing...and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as… more
    NVIDIA (05/28/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …the choice, join our diverse team today! As a member of the Hardware Infrastructure Farm team, you will provide leadership in the design and implementation of ground ... efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are… more
    NVIDIA (07/03/25)
    - Related Jobs
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving...software components that are critical building blocks for EC2 infrastructure . Every instance in EC2 is running some type… more
    Amazon (07/29/25)
    - Related Jobs
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Santa Clara, CA)
    …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... technologies in a multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC services. - Experience… more
    Amazon (06/12/25)
    - Related Jobs
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...peripheral device development (PCIe or NVMe) and building compute infrastructure to support High Memory and High performance computing… more
    Amazon (07/29/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation ... years of experience designing and operating large scale compute infrastructure + Experience with AI/ HPC advanced job...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (07/22/25)
    - Related Jobs
  • Senior Systems Engineer - Autonomous…

    NVIDIA (Santa Clara, CA)
    infrastructure and tools to enable NVIDIA's AV program. We are seeking a motivated Senior Engineer to join our team in building and scaling our cloud-native ... which powers 100s of micro-services and large scale HPC clusters (15k+ GPUs). You'll play a critical role...(15k+ GPUs). You'll play a critical role in driving infrastructure innovation across our organization. Ideal candidates will have… more
    NVIDIA (05/29/25)
    - Related Jobs
  • Senior Research Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in ... 10+ years of full-time industry experience in large-scale MLOps and AI infrastructure ; + Proven experience designing and optimizing distributed training systems with… more
    NVIDIA (06/07/25)
    - Related Jobs