- 
        Research Scientist Intern, Multimodal Generative…
- Meta (Redmond, WA)
- 
             Summary: The Meta Reality Labs Research Team brings together a world-class team of researchers, developers, and engineers to create the future of contextual AI and robotics. The Surreal Vision group at RL Research is seeking exceptional Research Scientists to research and help build the egocentric machine perception functionalities that will underpin future contextual AI-enabled devices. The research intern will work on cutting edge research problems to innovate novel computer vision and machine learning techniques.Work with researchers to advance frontier generative AI in the following areas:-Develop unified predictive models that integrate language, vision, human motion, and actions.-Investigate techniques to enable long-horizon, consistent and physically grounded generation. -Benchmark against state-of-the-art approaches in world modeling, video generation, and vision–language–action model.-Leverage multimodal generation to accelerate robot learning and control.Build contextual and embodied AI models using large-scale egocentric multimodal datasets.Our internships are twelve (12) to twenty four (24) weeks long and we have various start dates throughout the year. Some projects may require a minimum of 24 consecutive weeks. Required Skills: Research Scientist Intern, Multimodal Generative AI and Robotics (PhD) Responsibilities: 1. Plan and execute cutting-edge research and development to advance the state-of-the-art in machine learning and large-scale training. 2. Collaborate with other researchers and engineers across machine perception teams at Meta to develop experiments, prototypes, and concepts that advance the state-of-the-art contextual AI and robotic systems. 3. Work with the team to help design, setup, and run practical experiments and prototype systems related to large-scale high-quality sensing and machine reasoning. Minimum Qualifications: Minimum Qualifications: 4. Currently has, or is in the process of obtaining a PhD degree in the domain of computer-vision, computer graphics, 3D machine perception or deep learning 5. Knowledge in deep learning, computer vision, graphics, generative modeling, LLMs and VLMs 6. Hands-on experience with implementing deep learning algorithms, large-scale training, benchmark and evaluation 7. Experience working within Python environments such as pytorch 8. Experience working in a Unix environment 9. Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment Preferred Qualifications: Preferred Qualifications: 10. Preference for 24 week full time internship 11. Intent to return to a degree-program after the completion of the internship 12. Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at top tier conferences such as CVPR, ECCV, ICCV, SIGGRAPH, ICLR and NeurIPS 13. Strong track-record of published research in the fields of LLMs, VLMs, video generation, world modeling, VLA, human motion modeling, policy learning, generative modeling etc 14. Strong programming experience using python and pytorch 15. Demonstrated software engineer experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub) 16. Experience working and communicating cross functionally in a team environment Public Compensation: $7,650/month to $12,134/month + benefits **Industry:** Internet Equal Opportunity: Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected]. 
 
 
- 
        
Recent Jobs
- 
                
                    Research Scientist Intern, Multimodal Generative AI and Robotics (PhD)
                
                - Meta (Redmond, WA)