- Meta (Bellevue, WA)
- …host networking, communication libraries, and scheduling infrastructure. **Required Skills:** AI/ HPC Network Engineer Responsibilities: 1. Design, develop, test ... and operate networking systems to support large scale AI training jobs. 2. Establish and implement global best practices and contribute to the design of new scalable network solutions. 3. Research, develop and deploy numerous technologies and network… more
- Meta (Olympia, WA)
- …and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active member of ... a multi-disciplinary team to develop solutions for large scale training systems. 2. Responsible for the overall performance of the communication system, including performance benchmarking, monitoring and troubleshooting production issues. 3. Identify potential… more
- Meta (Bellevue, WA)
- …AI workload needs.We are hiring in multiple locations. **Required Skills:** Software Engineer , Systems ML - HPC Specialist Responsibilities: 1. Apply relevant ... **Summary:** Meta is seeking an AI Software Engineer to join our Research & Development teams....on the web.Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS,… more
- Amazon (Seattle, WA)
- Description The AWS High Performance Computing ( HPC ) team is looking for experienced SDE to work on a new HPC service. The HPC team is building a core set of ... that allow our customers to plan, schedule, and execute HPC workloads across the full range of AWS compute...different locations. This is an opportunity to operate and engineer systems on a global scale, while touching and… more
- Amazon (Seattle, WA)
- …of peer teams? We want to talk to you! We seek a Software Development Engineer for the Machine Learning (ML) Infrastructure team to build the tools that are used ... top performance of AWS ML and High Performance Computing ( HPC ) technologies developed by our organization. Bring your exceptional...Fabric Adapter (EFA). Key job responsibilities Be an autonomous engineer on a team that builds and maintains the… more
- Amazon (Seattle, WA)
- …of peer teams? We want to talk to you! We seek a Sr. Software Development Engineer for the Machine Learning (ML) Infrastructure team to build the tools that are used ... top performance of AWS ML and High Performance Computing ( HPC ) technologies developed by our organization. Bring your exceptional...Fabric Adapter (EFA). Key job responsibilities Be the lead engineer on a team that builds and maintains the… more
- Amazon (Seattle, WA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving...you like solving hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful… more
- Meta (Bellevue, WA)
- **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are the foundation ... and lifecycle of servers in production. **Required Skills:** Production Systems Engineer , Fleet AI Systems Lead Responsibilities: 1. Lead interfacing with external… more
- Meta (Bellevue, WA)
- **Summary:** Meta is seeking a Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are the foundation upon which ... and lifecycle of servers in production. **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Interface with external vendors and… more
- Amazon (Redmond, WA)
- …Qualifications - Experience working with ASIC teams and High-Performance Computing ( HPC ) environments - AWS certifications (eg, AWS Certified Solutions Architect, ... AWS Certified DevOps Engineer ) - Experience with container orchestration, monitoring tools, and database administration - Familiarity with incident management and… more
Recent Jobs
-
Senior Manager, Internal Audit
- West Pharmaceutical Services (Exton, PA)
-
Engineer Systems Architect 2
- Huntington Ingalls Industries (Wright Patterson AFB, OH)
-
Senior Project Controls Analyst
- TYLin (San Diego, CA)
-
Sr. Systems Safety Engineer
- Raytheon (Tucson, AZ)