- Microsoft Corporation (New York, NY)
- …model architectures and training methods for Vision - Language (VLM) and Vision - Language - Action ( VLA ) models + Proactive, real-time agents for ... learning agent platforms. Some of our projects include work on small language / action models (eg, Phi, Orca, Fara-7B), new architectures and optimizers (eg,… more
- Meta (Redmond, WA)
- …theory, optimization algorithms, representation learning, self-supervised learning, multimodal learning, vision - language - action ( VLA ) models, ... architectures, planning and control algorithms that involve tactile and multimodal vision and perception, dexterous manipulation and collision avoidance 3. Develop… more
- NVIDIA (Seattle, WA)
- …, tactile, and force/torque sensing) + Simulation, sim-to-real, and real-to-sim + Vision - language - action ( VLA ) models, including architectural ... have been presented at top robotics, AI, and computer vision conferences; these works include BayesSim (https://www.roboticsproceedings.org/rss15/p29.pdf) , cuRobo… more
- NVIDIA (Seattle, WA)
- …, tactile, and force/torque sensing) + Simulation, sim-to-real, and real-to-sim + Vision - language - action ( VLA ) models, including architectural ... have been presented at top robotics, AI, and computer vision conferences; these works include BayesSim (https://www.roboticsproceedings.org/rss15/p29.pdf) , cuRobo… more
- NVIDIA (Santa Clara, CA)
- …robot learning, including imitation and reinforcement learning. + Hands-on experience training Vision - Language - Action ( VLA ) models. + Experience ... generating synthetic data for robotics applications. + Great communication and collaboration skills. Ways to stand out from the crowd: + Experience learning from human video demonstrations or human-object reconstruction. + Expertise in dexterous bimanual… more
- NVIDIA (Santa Clara, CA)
- …simulation, sim-to-real, real-to-sim + Motion and task planning Robot Learning and Reasoning + Vision language action ( VLA ) models + Foundation models ... for robotics + Imitation and reinforcement learning + Foundation models for 3D perception + Synthetic data generation Click here to learn more about NVIDIA, our early talent programs, benefits offered to students and other helpful student resources related to… more
- General Motors (San Francisco, CA)
- …high-level reasoning and physical execution. Your work will focus on advancing vision - language - action architectures to solve critical challenges in data ... autonomy and machine learning. **About the Role:** As a VLM/ VLA Research Intern on the AI Research team, you...+ Drive the development of embodied foundation models and vision - language - action architectures that unify multimodal… more
- Bosch (Sunnyvale, CA)
- …automation. + **Advance 3D perception capabilities** by integrating large-scale vision - language - action models, enhancing reasoning, explainability, and ... research, our AI research in Silicon Valley focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data… more
- Meta (Redmond, WA)
- …against state-of-the-art approaches in world modeling, video generation, and vision - language - action model.-Leverage multimodal generation to accelerate ... AI in the following areas:-Develop unified predictive models that integrate language , vision , human motion, and actions.-Investigate techniques to enable… more
- NVIDIA (Santa Clara, CA)
- …etc. + Domain experience in current innovative deep learning methods (eg diffusion models, vision language action models, etc.) + Strong Python and/or C/C++ ... the world. Doing what's never been done before takes vision , innovation, and the world's best talent. As an...to run various classes of model architecture (Transformer, Diffusion, VLA , CNN, RNN etc.) on NVIDIA hardware leveraging techniques… more