- Google (San Jose, CA)
- …systems. + Work on building a generic framework for running evaluations for LLM generated content across the TV server stack. + Infrastructure to generate custom ... using LLMs. + Server infrastructure to support evals for Large Language Models ( LLM ) generated content and Agentic AI support on TV interfacing with Google agentic… more
- NVIDIA (Santa Clara, CA)
- …+ Contribute new features, fix bugs and deliver production code to TRT- LLM , NVIDIA's open-source inference serving library. + Profile and analyze bottlenecks across ... processor and system-level performance optimization. + Deep understanding of modern LLM architectures. + Strong fundamentals in algorithms. + GPU programming… more
- Amazon (Sunnyvale, CA)
- …of Smart Home experiences using the latest multimodal Large Language Models ( LLM 's) and Computer Vision. We are evolving Alexa into an intelligent, indispensable ... - Develop new inference and training techniques to improve the performance of LLM 's for Smart Home control and Automation - Solve hard problems in computer… more
- NVIDIA (Santa Clara, CA)
- …for Nsight tools. + Design, develop performance triages for upcoming and latest LLM chips. + Collaborate closely with the Hardware Architecture team to co-design and ... the software libraries. + Stay up-to-date with the latest advancements in LLM inference, hardware acceleration, and software optimization techniques. What we need to… more
- Meta (Menlo Park, CA)
- …Meta is seeking a Research Scientist to join our Llama Large Language Model ( LLM ) Research team. We are looking for recognized experts in VLLMs; with experience in ... **Preferred Qualifications:** Preferred Qualifications: 14. Direct experience in generative AI and LLM research. 15. Fluent in Python and PyTorch. 16. First author… more
- Meta (Menlo Park, CA)
- …Meta is seeking a Research Engineer to join our Llama Large Language Model ( LLM ) Research team. We are looking for recognized experts in VLLMs; with experience in ... 13. Industry research & development experiences in generative AI and LLM research. 14. Experience solving complex problems and comparing alternative solutions,… more
- Amazon (East Palo Alto, CA)
- …Transformers for vision-language modeling. - Hands-on experience in large-scale multimodal LLM and generative model training. Contributions to popular open-source ... LLM frameworks or research publications in top-tier AI conferences, such as CVPR, ECCV, ICCV, ICLR, etc. - Experience in GPU utilization and memory optimization… more
- NVIDIA (Santa Clara, CA)
- …systems and familiar with deep learning architectures and tools like NVIDIA TensorRT- LLM , Multimodal- LLM , and Triton Server. NVIDIA is widely considered to ... be one of the technology industry's most desirable employers. We have some of the most forward-thinking and hard-working people working with us and our engineering teams. If you're a creative engineer with a real passion for building scalable and robust… more
- Google (Mountain View, CA)
- …language models to public users. Our team develops and deploys new LLM -driven functionalities across Web, Android, iOS, and front-end server platforms, covering UI, ... architecture, and performance optimization + Experience developing and deploying user-facing LLM applications + Experience in user metrics logging and analysis +… more
- Ford Motor Company (Palo Alto, CA)
- …learning, and deep learning, to deliver robust and scalable solutions. 8. ** LLM Application & Innovation:** 9. Drive the exploration and implementation of Large ... data manipulation, analysis, and model development. + **Deep knowledge of LLM architectures and practical application frameworks** (eg, Hugging Face Transformers,… more