- NVIDIA (Santa Clara, CA)
- …or equivalent technologies + Experience with domain-specific language design and compiler optimizations, in particular sparse compilers (MLIR or TACO) + Excellent ... in AI and HPC + Good understanding of LLMs, Deep Learning methods and frameworks + Experience with low-level...parallel computing for science and engineering. More recently, GPU deep learning ignited modern AI - the next era… more
- NVIDIA (Santa Clara, CA)
- …coding (C++ and Python), analytical, and debugging + Good understanding of Deep Learning frameworks like PyTorch and TensorFlow, distributed training and inference. ... analysis for training/inference workload + Knowledge of Linux device drivers and/or compiler implementation + Knowledge of GPU and/or CPU architecture and general… more
- Amazon (Cupertino, CA)
- …accelerators and the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS ... Web Services (AWS) is looking for a Software Development Engineer II to build, deliver, and maintain complex products...Training team works side by side with chip architects, compiler engineers and runtime engineers to create , build… more
- Palo Alto Networks (Santa Clara, CA)
- …third-party libraries. 2. Performance & Security + **Performance Optimization:** Drive deep technical optimizations at the ** compiler and architectural levels** ... Make, Webpack). + **Strong systems-level programming skills** and experience with compiler -level optimization and performance profiling. + ** Deep experience with… more
- quadric.io, Inc (Burlingame, CA)
- …for different hardware configurations; This senior technical role demands deep knowledge of hardware architecture, compiler toolchain and optimization ... C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable...maximize hardware utilization + Make Improvement to Quadric toolchain, compiler and runtime + Provide technical support and documents… more
- Amazon (Cupertino, CA)
- …Inferentia (Inf1/Inf2) our cloud-scale Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS ... The ML Distributed Training team works side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training… more
- Amazon (Cupertino, CA)
- …and the Trn1 and Inf1 servers that use them. As the Software Development Engineer for the Neuron Foundation Tools Team, you will be responsible for working alongside ... hardware platforms such as Trainium and Inferentia devices, by providing deep insights into performance bottlenecks and system behavior. Improving performance of… more