- General Atomics (San Diego, CA)
- …wireless and laser technologies. We have an exciting opportunity for a Sr . HPC Systems and Storage Administrator to join our our Magnetic Fusion ... affiliated companies, is one of the world's leading resources for high-technology systems development ranging from the nuclear fuel cycle to remotely piloted… more
- SpaceX (Hawthorne, CA)
- Sr . HPC Systems Engineer...excel at this position. RESPONSIBILITIES: + Administer and manage HPC clusters, storage systems , and ... work extended hours and weekends as needed. COMPENSATION AND BENEFITS: Pay Range: SR . HPC Systems Engineer: $160,000.00-$220,000.00/per year Your actual… more
- NVIDIA (Santa Clara, CA)
- …play a crucial role in streamlining our testing processes. + Validation of distributed Storage systems (eg, Lustre) on AI/ HPC Datacenter scale infrastructure ... best we can be. We are looking for a Senior Software Validation Engineer to lead software validation activities...Kubernetes, Docker containers & Jenkins pipelines + Certifications in storage (eg, SNIA) or HPC systems… more
- Amazon (Santa Clara, CA)
- …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... physical or life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying IT systems… more
- Northrop Grumman (Redondo Beach, CA)
- …code deployment, maintenance, and optimization efforts. The lessons learned from existing HPC systems will inform the architecture, deployment, and utilization ... but are not limited to: + Develop and deploy architectures for future HPC systems based on engineering computing requirements, collaborating with engineering to… more
- NVIDIA (Santa Clara, CA)
- … including InfiniBand, RDMA and RoCE. + Understanding of fast, distributed storage systems such as Lustre and GPFS for AI/ HPC workload. + Familiarity with ... doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage… more
- NVIDIA (Santa Clara, CA)
- … systems teams to ensure smooth integration of networking, processing, and storage systems . + Partner with customers and ecosystem partners to co-innovate, ... the full potential of NVIDIA GPUs, DPUs, compute and storage servers through high-bandwidth, low-latency fabrics. + Stay at...Published work, patents, or advanced certifications in networking or HPC systems . NVIDIA is widely considered to… more
- NVIDIA (Santa Clara, CA)
- …with InfiniBand with IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads + Familiarity with deep ... doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage… more
- NVIDIA (Santa Clara, CA)
- …RDMA, RoCE and Amazon EFA. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workload. Experience working with deep ... doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage… more
- NVIDIA (Santa Clara, CA)
- …Observability is at the heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build the next-generation ... Our technology powers everything from generative AI to autonomous systems , and we continue to shape the future of...metrics, logs, traces, and events for GPU-powered AI and HPC workloads. + Build large-scale telemetry data pipelines leveraging… more