- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. We are seeking a highly-skilled Senior On- Device Model Inference Optimization Engineer to join our team ... you'll be doing: + Develop and implement strategies to optimize AI model inference for on- device deployment. + Employ techniques like pruning, quantization,… more
- Qualcomm (San Diego, CA)
- …+ Experience in LLM reasoning or inference acceleration research + Experience of on- device AI model production or exposure to mobile / edge device ... methods. **Responsibilities:** + Research and engineering on efficiency of LLM inference and decoding **, eg, speculative decoding, token-wise conditional computing,… more
- TP-Link North America, Inc. (Irvine, CA)
- …enable consumers to enjoy a seamless, effortless lifestyle. We are seeking a Senior AI/ML Computer Vision Engineer to drive the development and deployment of ... vacuum cleaners. This role is crucial for optimizing real-time machine learning inference and video analytics at the edge, ensuring seamless integration with cloud… more
- FocusKPI Inc. (Mountain View, CA)
- FocusKPI is looking for a Senior AI Web Development Engineer to join one...device (edge) LLM is a plus + ML model deployment and inference in the Android ... novel ways to use web content. They seek a Senior AI Web Development Engineer or Senior ...scraping frameworks like Puppeteer or a similar framework + On- device AI model optimization and quantization Desired… more