- Amazon (Seattle, WA)
- Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Want to ... delivering continuous price performance improvements in the cloud for AI model training for multi billion variable LLMs? Come...the current customer experience as well as developing improved systems for future designs. You will work directly with… more
- Amazon (Seattle, WA)
- Description As part of the Applied AI Solutions organization, we have a vision to provide business applications that are used by millions of companies worldwide to ... of the art in computer vision, machine learning, distributed systems and hardware design. As an experienced Hardware Design... and hardware design. As an experienced Hardware Design Engineer within our team, you will engage with a… more
- Amazon (Seattle, WA)
- …experience with PyTorch or Jax - preferably involving developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware. Amazon ... cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible… more
- Amazon (Seattle, WA)
- …instance types). We solve systemic hardware issues and we build hardware and software systems to detect and mitigate future recurrences so that our our customers can ... other teams. You will be responsible for hardware and systems that improve how we detect, root cause, and...you develop into a better-rounded professional. The Hardware Engineering AI / ML development team is a group of… more
- Amazon (Seattle, WA)
- …cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible ... design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Programming proficiency in Python or C++ (at least one… more
- Meta (Redmond, WA)
- …their software development skills to reliably introduce them at scale in production . **Required Skills:** Production Network Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...conceiving design solutions, developing, testing and deploying network software, systems , and tools that keep the Data Center network… more
- Meta (Olympia, WA)
- … workloads that power new Meta products. **Required Skills:** Network Production Engineer , Network Infrastructure Responsibilities: 1. Conceiving, developing, ... Meta products and experiences, and we are looking for Production Engineers who are interested in solving complex technical...and deploying systems and tools to keep the network running reliably… more
- Meta (Seattle, WA)
- …daily - solving problems at a scale few others face. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services which handle fleet ... engineers in the industry, you'll contribute to code and systems that go into production and are...Meta Ads, infrastructure components that drive Meta's advances in AI , core services which are used by every team… more
- Meta (Olympia, WA)
- …software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels Responsibilities: 1. ... to accelerate the next generation of deep learning models such as Recommendation systems , Generative AI , Computer vision, NLP etc 5. Performance tuning and… more
- MongoDB (Seattle, WA)
- …**Candidates must have:** + Experience leading engineering teams building and operating production systems + 5+ years software engineering experience, primarily ... management with deep technical expertise to solve complex distributed systems problems at scale. As the Lead Engineer...product roadmap + Drive incident response and postmortems for production storage systems We are hiring for… more