- Amazon (Sunnyvale, CA)
- …can run efficiently on resource-constrained devices. Currently, we enable production ML models across multiple device families, including Echo, Ring/Blink, ... language, and multimodal tasks. Crucially, you need to be a specialist in hardware-aware quantization, with hands-on experience in model compression techniques… more