- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC ...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- LTD Global (Berkeley, CA)
- …computing ( HPC ) and data analysis for the organization. Our center provides essential HPC and data systems to more than 10,000 researchers working in areas ... Position overview: We are seeking a Site Reliability Engineer to join our Operations Group. This role...part of a 24/7 operations team that ensures our systems are accessible, reliable, secure, and available to the… more
- SpaceX (Hawthorne, CA)
- …low-latency storage fabrics using RoCEv2 and/or InfiniBand for compute-intensive environments. + Engineer and support parallel file systems (eg, VAST, Lustre, ... IT Storage Engineer Hawthorne, CA Apply SpaceX was founded under...expertise in both enterprise storage platforms and high-performance computing ( HPC ) storage environments. This role is ideal for someone… more
- NVIDIA (Santa Clara, CA)
- …need to see: + 12+ years of experience in performance engineering, benchmarking, or HPC /AI systems . + Deep expertise in AI/ML and deep learning frameworks ... in search of a highly skilled Senior Storage Performance Engineer to join our ambitious team in Santa Clara,...we continue to push the boundaries of AI and HPC technologies. You will have the chance to create,… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services Division of the Technology and… more
- NVIDIA (Santa Clara, CA)
- …to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads ... be doing: + Design and develop enhancements to the HPC batch scheduler(s). + Work extensively with HPC...and tools + Accomplished in computer architecture and operating systems + Experience analyzing and tuning performance for a… more
- NVIDIA (Santa Clara, CA)
- …wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... implementing, and optimizing on-prem High-Performance Computing ( HPC ) storage solutions while harnessing the power of cloud computing. You will be responsible for… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
- Honeywell (San Jose, CA)
- As a **Lead IT Engineer for High Performance Computing ( HPC )** here at Honeywell, you will be at the forefront of our technology initiatives, driving the design ... in optimizing our computing resources and ensuring that our systems operate at peak efficiency. Honeywell's HPC ...our systems operate at peak efficiency. Honeywell's HPC infrastructure spans hundreds of computers in different regions… more
- Northrop Grumman (Los Angeles, CA)
- …ability to get cleared to SAP access level. **Basic Qualifications for a Principal Systems Engineer , Modeling and Simulation Systems /Software - (Level 03):** ... specific to modeling and simulation. **Basic Qualifications for an Engineer , Modeling and Simulation Systems /Software - (Level...running Monte Carlo simulations + Experience working with an HPC system + Experience with hardware in the loop… more