• Senior Site Reliability Engineer - AI Research…

    NVIDIA (Santa Clara, CA)
    …to enhance researcher productivity. + Tackle strategic challenges in large-scale, high - performance computing environments. + Troubleshoot, diagnose and ... scale compute infrastructure + Proven experience in site reliability engineering for high - performance computing environments with operational experience of… more
    NVIDIA (06/25/25)
    - Related Jobs
  • Sr. Worldwide Specialist Solutions Architect, HPC

    Amazon (Santa Clara, CA)
    …cloud computing and its potential to overcome some of the biggest challenges in High Performance Computing (HPC)? Do you have a unique combination of ... knowledge of HPC schedulers and distributed/parallel file systems , underlying IT systems , and the HPC development process, high throughput and tight coupling… more
    Amazon (06/12/25)
    - Related Jobs
  • Software Development Engineer, EC2 Instance…

    Amazon (Sunnyvale, CA)
    …networking, kernel development, and distributed systems - Understanding of high - performance computing clusters and parallel programming Preferred ... testing frameworks and stress testing tools for multi-rack distributed systems * Debug complex system -level issues across...- Strong programming skills in C/C++ with focus on high - performance systems - Experience with… more
    Amazon (06/10/25)
    - Related Jobs
  • Software Development Engineer, Nitro High

    Amazon (Sunnyvale, CA)
    …(PCIe or NVMe) and building compute infrastructure to support High Memory and High performance computing workloads. The Nitro High Memory and ... line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux...team owns the purpose built platform development for the High performance computing workloads and… more
    Amazon (04/29/25)
    - Related Jobs
  • Senior GPU Supercomputer Scheduler Engineer

    NVIDIA (Santa Clara, CA)
    …implementation of groundbreaking GPU compute clusters that run demanding deep learning, high performance computing , and computationally intensive workloads. ... some of the biggest challenges in machine learning, cloud computing , and system co-design. What you'll be...to stand out from the crowd: + Knowledge in High - performance computing + Open Source… more
    NVIDIA (05/21/25)
    - Related Jobs
  • Architect, Data Center Modeling

    NVIDIA (Santa Clara, CA)
    …NVIDIA , we are redefining industries with our groundbreaking advancements in High - Performance Computing , Artificial Intelligence, and Visualization. Our ... to a business-critical codebase, directly impacting the future of high - performance computing and AI technologies....experience (MS preferred) + 5+ years of experience in systems architecture and modeling, especially performance , power,… more
    NVIDIA (05/24/25)
    - Related Jobs
  • Senior HSIO Validation Engineer

    NVIDIA (Santa Clara, CA)
    …platforms for autonomous machines, Cloud and Data Centers, Deep learning, High - Performance Computing , Gaming, and Entertainment solutions. What ... NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High - Performance Computing , and Visualization. Our invention, the GPU,… more
    NVIDIA (05/29/25)
    - Related Jobs
  • Senior Architect, GPU and SoC Modelling

    NVIDIA (Santa Clara, CA)
    computing fields, delivering the highest performance in the world for high - performance computing . We are constantly looking for ways to improve our ... and features that advance the state of art in performance and efficiency. What you'll be doing: + Modeling...C++, C along with a good understanding of build systems (CMAKE, make), toolchains (GCC, MSVC) and libraries (STL,… more
    NVIDIA (05/07/25)
    - Related Jobs
  • Datacenter Resiliency Architect - New College Grad

    NVIDIA (Santa Clara, CA)
    …SOCs powering product lines for the growing field of artificial intelligence (AI) and high - performance computing (HPC). What you'll be doing: + Architect ... potential of AI to define the next era of computing . An era in which our GPU acts as...hardware and software Resiliency features to improve system Reliability, Availability, Serviceability (RAS), and performance more
    NVIDIA (05/20/25)
    - Related Jobs
  • Applied Scientist, Console Science

    Amazon (Santa Clara, CA)
    …data structures, parsing, numerical optimization, data mining, parallel and distributed computing , high - performance computing Preferred Qualifications ... a strong machine learning background to help build industry-leading Conversational AI Systems . Our mission is to provide a delightful experience to Amazon's… more
    Amazon (04/15/25)
    - Related Jobs