- Amazon (Seattle, WA)
- …compiler engineers and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large models using Python ... is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending...(design patterns, reliability and scaling) of new and existing systems experience - - 5+ years of full software… more
- Google (Seattle, WA)
- …of the following: web or mobile application development, Unix/Linux environments, distributed and parallel systems , machine learning, information retrieval, ... natural language processing, networking, developing large software systems , or security software development. + Experience developing accessible technologies. +… more
- Microsoft Corporation (Redmond, WA)
- …years of involvement in designing, developing and building large scale distributed backend systems including operations, performance, reliability, resilience, ... The team operates at the intersection of software and hardware, building systems that validate, track, and manage hardware throughout its lifecycle-from arrival to… more
- Warner Bros. Discovery (Bellevue, WA)
- …+ Experience in consumer-facing applications (growth, fraud, personalization) + Familiarity with distributed computing systems (Spark, Ray) + Publications in ML ... combine strong technical skills with a passion for learning to build scalable ML systems that drive business impact. This role requires a **blend of strong research… more
- Amazon (Seattle, WA)
- …test of models in the production system. - Design and maintain large-scale distributed training systems to support multi-modal foundation models. - Optimize GPU ... or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience programming with at least one software programming… more
- Microsoft Corporation (Redmond, WA)
- …approaches. + Knowledge of PII detection, data privacy, fairness, or compliance in AI systems . + Familiarity with distributed data systems (eg, Spark, ... Databricks, Azure Data Lake). + Strong collaboration skills with engineers, TPMs, and product partners across multiple orgs. Applied Sciences IC4 - The typical base pay range for this role across the US is USD $119,800 - $234,700 per year. There is a different… more
- Qualtrics (Seattle, WA)
- …Collaborate with the team to enhance testing frameworks and tools for complex, distributed SaaS systems , aiming to improve automation and testing reliability. + ... exploring how we can leverage AI and ML to improve our customer experience in using our systems + We strive to improve how we provide value to our end users. + We… more
- Teradata (Olympia, WA)
- …Proven ability to drive strategy and execution in database & data systems , distributed workload processing, and cloud-native architectures. + Hands-on knowledge ... bring:** + Deep technical expertise in query optimization, execution engines, distributed databases, and elastic scaling models. + Familiarity with vectorized query… more
- SHI (Olympia, WA)
- …to improve our platform. + Contribute to the architecture and design of distributed , cloud-native systems . **Skill Level Requirements** + Bachelor's degree in ... especially Microsoft Azure or AWS. + Familiarity with event-driven and distributed system architectures. + Understanding of DevOps practices, CI/CD pipelines, and… more
- SHI (Olympia, WA)
- …new ideas to the team. + Support the architecture and design of distributed , cloud-native systems in collaboration with senior engineers. **Skill Level ... especially Microsoft Azure or AWS. + Familiarity with event-driven and distributed system architectures. + Understanding of DevOps practices, CI/CD pipelines, and… more