- SpaceX (Hawthorne, CA)
- Sr. HPC Systems Engineer (Top Secret Clearance) Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the ... SYSTEMS ENGINEER (TOP SECRET CLEARANCE) SpaceX is looking for an HPC Systems Engineer with strong knowledge and experience in a world class… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Lead ... expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active ... of RDMA workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial ... center GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design, compute/storage and support bring… more
- Meta (Menlo Park, CA)
- …evolving AI workload needs.We are hiring in multiple locations. **Required Skills:** Software Engineer , Systems ML - HPC Specialist Responsibilities: 1. ... **Summary:** Meta is seeking an AI Software Engineer to join our Research & Development teams....on the web.Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS,… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC ...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- NVIDIA (Santa Clara, CA)
- …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems . This role involves working on collective operations - the fundamental ... kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or... is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving… more
- NVIDIA (Santa Clara, CA)
- …efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are ... responsible for the big picture of how our systems relate to each other, we use a breadth...and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to… more