• Senior Researcher - Foundations of Generative AI-…

    Microsoft Corporation (New York, NY)
    …model architectures and training methods for Vision - Language (VLM) and Vision - Language - Action ( VLA ) models + Proactive, real-time agents for ... learning agent platforms. Some of our projects include work on small language / action models (eg, Phi, Orca, Fara-7B), new architectures and optimizers (eg,… more
    Microsoft Corporation (12/17/25)
    - Related Jobs
  • Research Scientist Intern, Robotics Dexterous…

    Meta (Redmond, WA)
    …theory, optimization algorithms, representation learning, self-supervised learning, multimodal learning, vision - language - action ( VLA ) models, ... architectures, planning and control algorithms that involve tactile and multimodal vision and perception, dexterous manipulation and collision avoidance 3. Develop… more
    Meta (12/20/25)
    - Related Jobs
  • Research Scientist, Robotics Research - PhD New…

    NVIDIA (Seattle, WA)
    …, tactile, and force/torque sensing) + Simulation, sim-to-real, and real-to-sim + Vision - language - action ( VLA ) models, including architectural ... have been presented at top robotics, AI, and computer vision conferences; these works include BayesSim (https://www.roboticsproceedings.org/rss15/p29.pdf) , cuRobo… more
    NVIDIA (10/25/25)
    - Related Jobs
  • Senior Robotics Research Scientist

    NVIDIA (Seattle, WA)
    …, tactile, and force/torque sensing) + Simulation, sim-to-real, and real-to-sim + Vision - language - action ( VLA ) models, including architectural ... have been presented at top robotics, AI, and computer vision conferences; these works include BayesSim (https://www.roboticsproceedings.org/rss15/p29.pdf) , cuRobo… more
    NVIDIA (10/23/25)
    - Related Jobs
  • Senior Machine Learning Engineer - Humanoid…

    NVIDIA (Santa Clara, CA)
    …robot learning, including imitation and reinforcement learning. + Hands-on experience training Vision - Language - Action ( VLA ) models. + Experience ... generating synthetic data for robotics applications. + Great communication and collaboration skills. Ways to stand out from the crowd: + Experience learning from human video demonstrations or human-object reconstruction. + Expertise in dexterous bimanual… more
    NVIDIA (01/10/26)
    - Related Jobs
  • Nvidia 2026 Internships: PhD Robotics Research…

    NVIDIA (Santa Clara, CA)
    …simulation, sim-to-real, real-to-sim + Motion and task planning Robot Learning and Reasoning + Vision language action ( VLA ) models + Foundation models ... for robotics + Imitation and reinforcement learning + Foundation models for 3D perception + Synthetic data generation Click here to learn more about NVIDIA, our early talent programs, benefits offered to students and other helpful student resources related to… more
    NVIDIA (01/10/26)
    - Related Jobs
  • Summer Intern - AI/ML Intern - Vision

    General Motors (San Francisco, CA)
    …high-level reasoning and physical execution. Your work will focus on advancing vision - language - action architectures to solve critical challenges in data ... autonomy and machine learning. **About the Role:** As a VLM/ VLA Research Intern on the AI Research team, you...+ Drive the development of embodied foundation models and vision - language - action architectures that unify multimodal… more
    General Motors (01/07/26)
    - Related Jobs
  • Lead Research Scientist- GenAI for 3D Computer…

    Bosch (Sunnyvale, CA)
    …automation. + **Advance 3D perception capabilities** by integrating large-scale vision - language - action models, enhancing reasoning, explainability, and ... research, our AI research in Silicon Valley focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data… more
    Bosch (01/01/26)
    - Related Jobs
  • Research Scientist Intern, Multimodal Generative…

    Meta (Redmond, WA)
    …against state-of-the-art approaches in world modeling, video generation, and vision - language - action model.-Leverage multimodal generation to accelerate ... AI in the following areas:-Develop unified predictive models that integrate language , vision , human motion, and actions.-Investigate techniques to enable… more
    Meta (12/20/25)
    - Related Jobs
  • Senior Software Engineer, Deep Learning…

    NVIDIA (Santa Clara, CA)
    …etc. + Domain experience in current innovative deep learning methods (eg diffusion models, vision language action models, etc.) + Strong Python and/or C/C++ ... the world. Doing what's never been done before takes vision , innovation, and the world's best talent. As an...to run various classes of model architecture (Transformer, Diffusion, VLA , CNN, RNN etc.) on NVIDIA hardware leveraging techniques… more
    NVIDIA (01/10/26)
    - Related Jobs