• Senior Software Architect - Deep Learning…

    NVIDIA (Santa Clara, CA)
    …vision? What you will be doing: + Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems . + Design and ... implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative solutions in HW and SW for our next generation platforms as… more
    NVIDIA (11/04/25)
    - Related Jobs
  • Distinguished Software Architect - Deep Learning…

    NVIDIA (Santa Clara, CA)
    …is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the ... We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . We are looking for a Distinguished Software Architect to help co-design our… more
    NVIDIA (11/21/25)
    - Related Jobs
  • Senior HPC Architect

    NVIDIA (Santa Clara, CA)
    …improved workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist to architect, develop ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...looking for an outstanding hands-on architect/engineer for a Senior HPC architect role to support deployment and bringup of… more
    NVIDIA (01/07/26)
    - Related Jobs
  • Senior GPU and HPC Infrastructure Engineer…

    NVIDIA (Santa Clara, CA)
    …, and excellent communication and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high- performance networking (RDMA, ... of Linux system administration and management. + Understanding of cluster management systems (Kubernetes, SLURM) + Understanding of performance , security and… more
    NVIDIA (01/08/26)
    - Related Jobs
  • Intern 2026: AI Systems Research…

    IBM (San Jose, CA)
    …technical areas in the context of hybrid cloud, AI systems , networking, security, high-speed networked-storage, accelerators, and HPC principles. The ... focuses on the next generation Hybrid Cloud infrastructure for AI , Storage, HPC and Quantum applications. The...Experience with GPU Systems * Familiarity with HPC system performance evaluation. * Familiarity with… more
    IBM (11/22/25)
    - Related Jobs
  • Summer Intern - Computational Sciences Center…

    Genentech (South San Francisco, CA)
    …Position** **2026 Summer Intern - Computational Sciences Center of Excellence -** ** AI systems performance engineering** **Department Summary** A healthier ... and scientific computing workloads. + Support benchmarking and performance testing efforts for AI systems...HPC or cloud environments. + Experience with distributed systems and parallel computing techniques, including data, model, and… more
    Genentech (01/07/26)
    - Related Jobs
  • Research Scientist, AI & Systems

    Meta (Menlo Park, CA)
    …on existing accelerator systems and guiding the future of models and AI HW at Meta. This drives improved performance , new model architectures and ... the following areas: Accelerators/GPU architectures, High Performance Computing ( HPC ), Machine Learning Compilers, Training/Inference ML Systems , Model… more
    Meta (12/20/25)
    - Related Jobs
  • Sr. System Development Engineer, High-…

    Amazon (Cupertino, CA)
    …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...the limits of performance , efficiency, and scalability in the cloud, this is… more
    Amazon (10/25/25)
    - Related Jobs
  • Principal Software Engineer, Networking…

    Oracle (Sacramento, CA)
    …what's possible. Responsibilities + Lead architecture, system design, and implementation for high- performance RDMA solutions across OCI's AI / HPC platforms, ... If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this... performance tuning at scale. + Familiarity with AI / HPC stacks and workloads: NCCL/RCCL/MPI, Slurm or… more
    Oracle (12/20/25)
    - Related Jobs
  • Systems Development Eng (AWS Generative…

    Amazon (Cupertino, CA)
    …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
    Amazon (12/10/25)
    - Related Jobs