- Meta (Menlo Park, CA)
- …Training, GPU architecture, ML systems, AI infrastructure, high performance computing , performance optimizations, or Machine Learning frameworks (eg PyTorch) ... Tensor Parallel, and Pipeline Parallel 12. Experience in HPC and parallel computing 13. Experience working with DL frameworks like PyTorch, Caffe2 or TensorFlow… more
- Meta (Menlo Park, CA)
- …forwarding functions 7. Enhance HPC collective communication and parallel computing libraries (NCCL, RCCL, OneCCL, MPI) **Minimum Qualifications:** Minimum ... version 2 (RoCEv2) 16. Qemu, FPGA Emulation environment is a plus 17. Parallel computing platforms such as CUDA, RoCM and OpenCL 18. Experience with one of Platform… more
- Meta (Menlo Park, CA)
- …Training, GPU architecture, ML systems, AI infrastructure, high performance computing , performance optimizations, or Machine Learning frameworks (eg PyTorch) ... (FSDP), Tensor Parallel, and Pipeline Parallel 9. Experience in HPC and parallel computing 10. Knowledge of ML, deep learning and LLM 11. Experience with NCCL… more
- Stanford University (Stanford, CA)
- …and implement new features and technologies, and integrate them into the computing environment. * Follow team software development methodology. * Mentor lower level ... back end. * Fluency in SQL, Python and R * Familiarity with Cloud computing paradigm and platforms like Google Cloud or Azure * Experience with containerization… more
- University of Southern California (Los Angeles, CA)
- …of modern data center technologies (eg, virtualization, containerization, cloud computing , disaster recovery). + Technical expertise necessary to deliver enabling ... of modern data center technologies (eg, virtualization, containerization, cloud computing , disaster recovery). Technical expertise necessary to deliver enabling… more
- Meta (Menlo Park, CA)
- …ML Training, GPU architecture, ML systems, AI infrastructure, high performance computing , performance optimizations, or Machine Learning frameworks (eg PyTorch) 5. ... large-scale distributed deep learning models 10. Experience in HPC and parallel computing 11. Knowledge of GPU architectures and CUDA programming 12. Knowledge of… more
- Google (Sunnyvale, CA)
- …empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that ... From software to hardware our teams are shaping the future of world-leading hyperscale computing , with key teams working on the development of our TPUs, Vertex AI… more
- Oracle (Redwood City, CA)
- **Job Description** **Senior Software Developer for High Availability Cloud Computing ** Oracle Data Guard is the industry leader for enterprise data protection and ... complex problems under limited supervision. Our projects are driven by our Cloud Computing and customer needs as well as innovative ideas that percolate from the… more
- Google (San Francisco, CA)
- …empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that ... From software to hardware our teams are shaping the future of world-leading hyperscale computing , with key teams working on the development of our TPUs, Vertex AI… more
- Stanford University (Stanford, CA)
- …in leading teams to apply data science, machine learning, and other advanced computing techniques to solve complex research problems. + Expertise in data science, ... and staying abreast of the latest technologies and methodologies in research computing . + Service-oriented leadership style, with an emphasis on supporting and… more