- Electric Power Research Institute (Palo Alto, CA)
- **Job Title:** Director, Agentic AI Initiatives & Distributed AI Architecture **Location:** Charlotte, NC, Knoxville, TN, Palo Alto, CA **Job Summary and ... Description:** **Director, Agentic AI Initiatives & Distributed AI Architecture** **Position Purpose** EPRI is seeking a visionary Director to develop and lead two… more
- Google (Sunnyvale, CA)
- Senior Software Engineer, Google Distributed Cloud, Kubernetes _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving ... and 1 year of experience with software design and architecture for distributed systems. **Preferred qualifications:** + Master's degree or PhD in Computer Science… more
- Amazon (Cupertino, CA)
- …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
- DoorDash (San Francisco, CA)
- …for DoorDash Engineering. The Cassandra team under Storage is looking for passionate distributed systems engineers to architect and scale the next evolution of our ... and efficiency. You will help us bootstrap and scale our internal distributed database infrastructure centered around Cassandra, with a focus on reliability,… more
- Amazon (Cupertino, CA)
- …as well as stable diffusion, Vision Transformers and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large...using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending… more
- Nutanix (San Jose, CA)
- …Opportunity** Are you a passionate software engineer eager to tackle complex distributed systems challenges, skilled in modern programming languages like Golang and ... migration using Golang and Python. + Enhance the resiliency and availability of distributed system services within the AHV control plane. + Optimize algorithms for… more
- Google (Sunnyvale, CA)
- Leadership Program Manager, Google Distributed Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA **Advanced** Experience owning outcomes ... developing formal supply chain risk mitigation frameworks. + Experience supporting distributed , or hybrid cloud hardware deployment models. + Experience in managing… more
- Amazon (East Palo Alto, CA)
- Description Amazon Aurora DSQL is a serverless, distributed SQL database with virtually unlimited scale, highest availability, and zero infrastructure management. ... to lead research and development in advanced query optimization techniques for distributed sql services. You will innovate in the query planning and execution… more
- Amazon (Cupertino, CA)
- …that use them. As the SDM of Software Development for the Machine Learning Distributed Training team, you will be responsible for leading a strong team of engineers ... for inference and training support in Pytorch, XLA, JAX as well as distributed training libraries like FSDP, DDP and others. Includes enabling models using MoE… more
- Google (Sunnyvale, CA)
- Site Reliability Engineer, Google Distributed Cloud, Connected SRE _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and ... projects. + 3 years of experience designing, analyzing, and troubleshooting distributed systems. **Preferred qualifications:** + Master's degree in Computer Science… more