- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Lead ... deal with on a daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we… more
- Meta (Sacramento, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: … more
- Amazon (Cupertino, CA)
- Description Do you like to use network and Unix systems engineering to deliver simple, sustainable, and repeatable solutions? Would you like to play a key role ... to own them to completion. The Core Networking team is looking for a Network Development Engineer to join our Network Fabric Engineering (NFE) team. As a … more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial ... GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design,...+ Demonstrate subject matter expertise in advanced GPU & network systems and be a trusted technical… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are the ... and life cycle of servers in production. **Required Skills:** Production Systems Engineer , Sustaining Responsibilities: 1. Develop robust, industry leading… more
- Meta (Menlo Park, CA)
- …Qualifications:** Preferred Qualifications: 11. Full-stack experience and understanding of AI/ HPC systems , from HW/infrastructure through the application layer, ... in some of the world's largest scale clusters. **Required Skills:** Software Engineer , Accelerator Systems & Technologies Responsibilities: 1. Understand and… more
- LinkedIn (Sunnyvale, CA)
- …networks. We develop tools and automate processes to support our hypergrowth. As a Staff Network Engineer , you'll play a pivotal role as a technical leader and ... balances competing priorities. In addition to leadership acumen, a successful Staff Network Engineer should demonstrate sufficient proficiency in both networking… more
- Qualcomm (San Diego, CA)
- …Technology Group > IT Networking **General Summary:** Join our dynamic Data Center Network team and be at the forefront of crafting our next-generation hyper-scale ... network infrastructure. We're on the lookout for passionate innovators...for passionate innovators who thrive on building sophisticated, large-scale systems designed for peak performance and unwavering reliability. With… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI/ML initiatives supporting large scale AI Training and ... to hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Lead Responsibilities: 1....rack level and at scale, as well as debugging AI/ HPC systems , performance optimizations, including familiarity with… more
Recent Jobs
-
Senior Test Analyst
- Raymond James Financial, Inc. (St. Petersburg, FL)
-
RCC Training Coordinator
- Seventh Dimension (Macdill AFB, FL)
-
Child Safety Content Moderation Analyst
- US Tech Solutions (Austin, TX)