- Meta (Olympia, WA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
- Meta (Bellevue, WA)
- …of RDMA workloads that expects a loss-less fabric interconnect. To enhance the performance of these systems , we continuously seek opportunities for improvement ... host networking, communication libraries, and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. Design, develop, test… more
- Meta (Bellevue, WA)
- …end-to-end system validation strategy (hardware and software), with a focus on various AI / HPC hardware systems in datacenter applications. 2. Lead the ... algorithms, and OOP). **Preferred Qualifications:** Preferred Qualifications: 17. Proficiency in High- Performance Computing ( HPC ) or AI system architecture… more
- Amazon (Seattle, WA)
- …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
- Amazon (Seattle, WA)
- …operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. AWS Infrastructure Services owns the design, planning, ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...the current customer experience as well as developing improved systems for future designs. You will work directly with… more
- Amazon (Seattle, WA)
- …are used to guarantee top performance of AWS ML and High Performance Computing ( HPC ) technologies developed by our organization. Bring your exceptional ... knowledge of CI/CD automation, ML and HPC benchmarks and applications to bear on the cutting-edge...Join us as we expand the AWS offerings for AI , including Trainium, Graviton and the Elastic Fabric Adapter… more
- Amazon (Seattle, WA)
- …are used to guarantee top performance of AWS ML and High Performance Computing ( HPC ) technologies developed by our organization. Bring your exceptional ... knowledge of CI/CD automation, ML and HPC benchmarks and applications to bear on the cutting-edge...Join us as we expand the AWS offerings for AI , including Trainium, Neuron and the Elastic Fabric Adapter… more
- Amazon (Seattle, WA)
- Description We are seeking an experienced engineer to work on distributed AI /ML systems . This role involves working on collective operations - the fundamental ... operations that enable AI to scale across multiple accelerators & servers. Most...building networking solutions that for Machine Learning (ML) and High- Performance Computing ( HPC ) workloads on AWS. We… more
- NVIDIA (Redmond, WA)
- …workshops, etc. + Analyze and develop solutions for customer performance issues for both AI workload and systems performance . What we need to see: + ... networking and help develop accelerated computing networking solutions for AI /ML and HPC with our Hyperscaler customers....systems in general including but not limited to performance testing/tuning, benchmarking, etc. + Strong systems … more
- Pacific Northwest National Laboratory (Richland, WA)
- …content (eg, blogs, whitepapers, presentations). + Specialized technical/functional (eg, Cloud/ HPC computing, Security, AI ) experience. Marketing Prowess + ... content (eg, blogs, whitepapers, presentations). + Specialized technical functional (eg, Cloud/ HPC computing, Security, AI ) experience. + Experience guiding… more
Recent Jobs
-
Senior Ad Policy Manager, Advertising Trust Policy
- Amazon (New York, NY)