- TP-Link North America, Inc. (Irvine, CA)
- …to enjoy a seamless, effortless lifestyle. OVERVIEW As a Senior Cloud Engineer - Distributed Database & Middleware, you will design and optimize the architecture of ... distributed database and middleware systems to ensure scalability, high...RESPONSIBILITIES: Architecture Design + Design the overall architecture of distributed database and middleware systems to ensure high availability,… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Software Engineer to develop distributed storage services for AI/ML. The goal is to craft a reliable, scalable, and efficient ... looking for an engineer with a deep understanding of distributed systems, outstanding design skills, and a track record...and a track record in building and delivering large-scale distributed services. What you will be doing: + Leading… more
- Cisco (San Jose, CA)
- Senior Distributed Golang Software Engineer, Isovalent Tetragon Team (US) Apply (https://jobs.cisco.com/jobs/Login?projectId=1444334) + Location:Offsite, San Jose, ... incubating next-generation network security services. Our team builds highly available distributed systems that power cloud firewalls, Web proxies, Zero Trust… more
- Amazon (Cupertino, CA)
- …Machine Learning Engineer on one of our AWS Neuron teams: - The ML Distributed Training team works side by side with chip architects, compiler engineers and runtime ... engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a must. FSDP… more
- NVIDIA (Santa Clara, CA)
- …on efficient, scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving ... applications. Our team is addressing the most challenging issues in distributed AI infrastructure, and we're searching for engineers enthusiastic about building… more
- Amazon (Cupertino, CA)
- …as well as stable diffusion, Vision Transformers and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large...using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending… more
- DoorDash (San Francisco, CA)
- …for DoorDash Engineering. The Cassandra team under Storage is looking for passionate distributed systems engineers to architect and scale the next evolution of our ... and efficiency. You will help us bootstrap and scale our internal distributed database infrastructure centered around Cassandra, with a focus on reliability,… more
- Amazon (Cupertino, CA)
- …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
- Amazon (Cupertino, CA)
- …that use them. As the SDM of Software Development for the Machine Learning Distributed Training team, you will be responsible for leading a strong team of engineers ... for inference and training support in Pytorch, XLA, JAX as well as distributed training libraries like FSDP, DDP and others. Includes enabling models using MoE… more
- Confluent (Sacramento, CA)
- …a great developer experience. You will have an opportunity to solve complex distributed systems problems at scale. You will build services that can operate across ... it. The WarpStream team is fully remote and geographically distributed , currently we have engineers in: Spain, France, Canada,...C++, Python, etc + Deep curiosity and enthusiasm for distributed systems and storage systems + Strong focus on… more