- Amazon (Seattle, WA)
- Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Want to ... delivering continuous price performance improvements in the cloud for AI model training for multi billion variable LLMs? Come...who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking,… more
- Amazon (Seattle, WA)
- Description Do you want to shape the future of Generative AI at AWS? Join the team building the foundation of the world's most advanced cloud for AI training and ... deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing the limits… more
- Amazon (Seattle, WA)
- …Join the team building the foundation of the world's most advanced cloud for AI training and inference - where multi-billion-parameter models come to life at scale. ... and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing...of the Hardware Engineering team you will own and lead the design, development and root cause of a… more
- Amazon (Seattle, WA)
- Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Want to ... delivering continuous price performance improvements in the cloud for AI model training for multi billion variable LLMs? Come...team in this specific function, you will own and lead the design, development and root cause of a… more
- Oracle (Olympia, WA)
- …components of Oracle's Cloud Infrastructure. You should be both a rock-solid lead developer, curious problem solver, a distributed systems generalist and/or skilled ... Linux engineer with Systems triage experience able to dive deep...excited to learn. This role resides within the Compute AI Infrastructure Bare Metal Provisioning team, which owns the… more
- Amazon (Bellevue, WA)
- …of our AI solutions by maintaining the highest standards of data quality throughout the development process while building capability within the broader team. ... business needs** Embark on a transformative journey as our Sr. Domain Expert Lead , where intellectual rigor meets technological innovation. As a Sr. Domain Expert … more
- Amazon (Seattle, WA)
- …words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ... we detect, root cause, and remediate issues. You will lead cross functional investigations and define changes needed to...and manufacturing partners to bring these servers to the data center. After launch you will oversee the fleet… more
- Amazon (Seattle, WA)
- …Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, responsible for development, ... with training these large models using Python is a must. FSDP (Fully-Sharded Data Parallel), Deepspeed, Nemo and other distributed training libraries are central to… more
- Amazon (Seattle, WA)
- …Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, responsible for development, ... Distributed training with awareness of strategies like FSDP (Fully-Sharded Data Parallel), PP, Context parallel. Distributed training libraries like torchtitan,… more
- Microsoft Corporation (Redmond, WA)
- …data engineering standards that enable reliable, secure, and scalable analytics for AI -driven products. Additionally, the Data Engineer will deeply ... flow across existing and new products, differentiate human vs. AI Agent generated data , lead ...data models, and prepare design specifications for building data pipelines. The Data engineer … more