- SpaceX (Hawthorne, CA)
- IT Storage Engineer Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more ... ultimate goal of enabling human life on Mars. IT STORAGE ENGINEER SpaceX is seeking an experienced...Engineer and support parallel file systems (eg, VAST, Lustre , BeeGFS) for high concurrency workloads in simulation and… more
- NVIDIA (Santa Clara, CA)
- …wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... with storage protocols such as nfs, NVMe/TCP, S3 and Lustre (LNet) + Experience with containerization technologies like Kubernetes and their integration… more
- NVIDIA (Santa Clara, CA)
- …guide us to be the best we can be. We are looking for a Senior Software Validation Engineer to lead software validation activities in the Datacenter Systems ... Engineering team. You'll work closely with solution architects, HW system engineers, Software , Network & Storage architects, validation engineers, OEM/ODMs, and… more
- NVIDIA (Santa Clara, CA)
- …+ Familiarity with InfiniBand with IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/HPC workloads + Familiarity ... We are seeking a Senior AI/ML Performance and Efficiency Engineer , GPU Clusters at NVIDIA to join our AI...organizations to deliver efficiency in our usage of hardware, software , and infrastructure + Proactively monitor fleet wide utilization… more
- NVIDIA (Santa Clara, CA)
- …pertaining to HPC including InfiniBand, RDMA and RoCE. + Understanding of fast, distributed storage systems such as Lustre and GPFS for AI/HPC workload. + ... on the world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA and high-performance… more
- NVIDIA (Santa Clara, CA)
- …including InfiniBand, RDMA, RoCE and Amazon EFA. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/HPC workload. Experience ... large-scale HPC systems including the deployment of compute, networking, and storage . + Develop and improve our ecosystem around GPU-accelerated computing including… more