- Amazon (Cupertino, CA)
- …between ML frameworks and hardware acceleration, while building strong foundations in distributed systems . We're looking for someone with solid programming ... accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron,...Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip… more
- Amazon (Cupertino, CA)
- …Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, responsible for development, ... Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip...(design patterns, reliability and scaling) of new and existing systems experience - 5+ years of full software development… more
- Amazon (Cupertino, CA)
- …Machine Learning accelerators. This role is for a senior machine learning engineer in the Distribute Training team for AWS Neuron, responsible for development, ... Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip...(design patterns, reliability and scaling) of new and existing systems experience - 5+ years of full software development… more
- Google (Sunnyvale, CA)
- …setting. + 2 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, ... and Kubernetes (K8s) on cloud. + Experience with or managing storage systems . Google Cloud's software engineers develop the next-generation technologies that change… more
- Google (Sunnyvale, CA)
- …design. + 7 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, ... bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage,...on and is growing every day. As a software engineer , you will work on a specific project critical… more
- Google (Sunnyvale, CA)
- …C, C++, Go. + 2 years of experience with developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage ... Science or related technical fields. + Experience with development of low-level systems software, Linux Kernel, networking stack, Crypto APIs etc. Google's software… more
- NVIDIA (Santa Clara, CA)
- …at scale! We are seeking a highly technical and creative Senior Technical Marketing Engineer to join our team to showcase the innovations that power the training of ... world's largest AI models. This role will focus on distributed AI model training, ensuring that customers and partners...7+ years of experience in deep learning engineering, HPC systems , AI infrastructure, or technical evangelism roles. + Strong… more
- Google (Sunnyvale, CA)
- …design. + 7 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, ... bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage,...on and is growing every day. As a software engineer , you will work on a specific project critical… more
- Amazon (Cupertino, CA)
- …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is ... and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large...(design patterns, reliability and scaling) of new and existing systems experience - - 5+ years of full software… more
- Amazon (Cupertino, CA)
- …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is ... and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large...(design patterns, reliability and scaling) of new and existing systems experience - - 5+ years of full software… more