- Amazon (East Palo Alto, CA)
- …massive datasets. - Develop robust monitoring and debugging tools to ensure the reliability and performance of training workflows on large GPU clusters. Design and ... operations. - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience. - Proficient in… more
- Amazon (Sunnyvale, CA)
- …Kuiper and Government customer performance needs including availability, reliability , upgradeability, interoperability, and security requirements. - Effectively ... language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a… more
- Howmet Aerospace (Fontana, CA)
- …the Maintenance and Facilities Manager in order to enhance equipment reliability and to develop facilities improvements. ESSENTIAL DUTIES AND RESPONSIBILITIES: This ... support equipment to streamline production and maintenance techniques. + Assist with on site repairs as required. + Complete special work as assigned. + Other duties… more
- Amazon (El Segundo, CA)
- …Kuiper and US Government performance needs including availability, reliability , upgradeability, interoperability, and security requirements - Effectively communicate ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
- Amazon (East Palo Alto, CA)
- …deliver and maintain large scale features * Enhance the architecture, scalability, reliability , and performance of the system * Provide mentorship and support to ... language experience - 4+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a… more
- Amazon (Sunnyvale, CA)
- …language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a ... - Experience contributing to the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems - Experience leading and… more
- Amazon (Cupertino, CA)
- …systems that validate hardware quality in manufacturing; monitoring and improving hardware reliability in data centers and platform. We cover everything from low ... while taking into consideration our customer needs from a cost, performance, and reliability perspective. About the team Why AWS Amazon Web Services (AWS) is the… more
- Amazon (Cupertino, CA)
- …systems that validate hardware quality in manufacturing; monitoring and improving hardware reliability in data centers and platform. We cover everything from low ... while taking into consideration our customer needs from a cost, performance, and reliability perspective. About the team Within AWS AWS Board Core Design & Services… more
- Amazon (Culver City, CA)
- …the right solutions by working on the following: * Design (design patterns, reliability and scaling) and maintenance of new and existing systems. * Scalable backend ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
- Amazon (Cupertino, CA)
- …software development experience - 3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience - 4+ years ... Scrum methodology - 5+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience - Experience automating… more