• Storage Engineer

    The Walt Disney Company (Emeryville, CA)
    …of our team, you'll apply your understanding of high-performance computing ( HPC ) storage architectures to administer and help architect our centralized storage ... within our studio. **RESPONSIBILITIES:** + Provide operational support for our on-prem HPC storage systems + Collaborate with vendors to develop and conduct… more
    The Walt Disney Company (07/25/25)
    - Related Jobs
  • Senior Software Engineer, AI Resiliency

    NVIDIA (Santa Clara, CA)
    …in debugging and performance tuning large-scale AI workloads in cloud and HPC environments, ensuring seamless operation of AI training and inference workloads. What ... computing in AI training. + Experience working with large-scale AI clusters, HPC environments, or cloud-based AI workloads . + Strong systems programming skills… more
    NVIDIA (07/22/25)
    - Related Jobs
  • Senior Manager, Software Engineering - Data…

    NVIDIA (Santa Clara, CA)
    …Now, GPU deep learning is driving modern AI forward. Join our GPU AI/ HPC Infrastructure team and lead the design of groundbreaking GPU compute clusters for ... demanding AI, HPC , and compute-intensive workloads. We are seeking an engineering leader for the Data Platform team to empower NVIDIA engineering teams with high… more
    NVIDIA (07/12/25)
    - Related Jobs
  • Director of Cloud and Datacenter Operations

    Cadence Design Systems, Inc. (San Jose, CA)
    …matter expert for infrastructure initiatives, particularly in High Performance Computing ( HPC ) environments + Drive operational excellence by + Achieving highest ... of 8 years of experience + Proven experience in technical systems ( HPC , networking, storage) + Demonstrated ability to operate and manage large-scale,… more
    Cadence Design Systems, Inc. (07/10/25)
    - Related Jobs
  • Senior Linux Kernel Systems Software Engineer…

    NVIDIA (Santa Clara, CA)
    …spanning firmware, OS, middleware, and applications with focus on AI/ML and HPC workloads. + Perform advanced system debugging, root cause analysis, and performance ... we need to see: + Deep expertise in data center server architectures, HPC systems, and hardware-software co-design. + Expert knowledge of Linux kernel internals,… more
    NVIDIA (07/02/25)
    - Related Jobs
  • Senior Networking Application Engineer - NVLINK…

    NVIDIA (Santa Clara, CA)
    …help design and deploy cutting-edge NVIDIA networking platforms to run AI and HPC workloads + Address sophisticated and highly visible customer issues + Work closely ... + Familiarity with the Infiniband spec + Experience with distributed processing, HPC , and Message Passing Interface (MPI) + Strong analytical and problem-solving… more
    NVIDIA (06/17/25)
    - Related Jobs
  • AI Engineering Manager/Solutions Architect - SFL…

    Deloitte (Sacramento, CA)
    …in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI/GenAI infrastructure and design patterns + Define and lead technology proof of ... Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack Information for applicants with a need for accommodation:… more
    Deloitte (06/12/25)
    - Related Jobs
  • Computer Systems Manager

    General Atomics (San Diego, CA)
    …focusing primarily on Linux systems, advanced computing environments such as HPC , cyber security, file systems, data backup), concepts, theory, and practice ... and podamn. Linux KVM/QEMU/Libvirt based virtual host provisioning and management. HPC related software like SLURM, InfiniBand, environment modules (LMOD), parallel… more
    General Atomics (06/12/25)
    - Related Jobs
  • Senior Software Engineer, GPU Communications…

    NVIDIA (Santa Clara, CA)
    …communication runtimes for Deep Learning frameworks (eg NCCL for TensorFlow/Pytorch) and HPC programming interfaces (eg UCX for MPI/OpenSHMEM) on GPU clusters. + ... of high-performance networks like InfiniBand, iWARP etc. + Experience with HPC applications. + Experience with Deep Learning Frameworks such PyTorch, TensorFlow,… more
    NVIDIA (06/12/25)
    - Related Jobs
  • Senior Software Engineer, Infrastructure, Google…

    Google (Sunnyvale, CA)
    …as we continue to push technology forward. GCP is favored by AI/ML and HPC customers for its rich heritage of offerings including TPUs, GPUs, GKE for scheduling ... workloads, Vertex AI, integration with GCS object storage, etc. Customers want their Lustre file system to work seamlessly with all these technologies, and deliver the maximum performance. Come help us make Lustre shine on GCP and work cohesively with all… more
    Google (08/25/25)
    - Related Jobs