-
AI Research Scientist, Computer Vision - Facebook…
- Meta (Menlo Park, CA)
-
Summary:
The Video Intelligence team is an applied AI research team within the Facebook pillar. This role is expected to develop advanced video generation and understanding foundation models, enabling innovative AI-driven video creation experiences and enhancing our ability to comprehend video content. The team is responsible for building State-of-the-art GenAI technology to empower video generation and understanding.
Required Skills:
AI Research Scientist, Computer Vision - Facebook Video Intelligence Responsibilities:
1. Build a variety of multimodal foundation models such as text-to-video generative models, image-to-video generative models, video understanding models, unified native video generative models
2. Design core foundation model architectures and progressive pre-train
3. Post-train foundation models using techniques such as Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), Direct Preference Optimization (DPO), and Low-Rank Adaptation (LoRA)
4. Conduct research to develop SOTA GenAI models for the Facebook family of apps
5. Collaborate with colleagues from the infrastructure and product teams on launching models
Minimum Qualifications:
Minimum Qualifications:
6. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
7. PhD in Computer Science, Machine Learning, or a relevant technical field
8. 1+ year of industry experience training multimodal, computer vision, LLM or related AI/ML models
9. Experience owning and/or driving complex technical projects from end-to-end
10. Publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
11. Programming experience in Python and hands-on experience with frameworks such as PyTorch
12. Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Preferred Qualifications:
Preferred Qualifications:
13. First-authored publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
14. Experience collaborating in cross-functional teams, including product, engineering, and research
15. Experience building text-to-video generative models, image-to-video generative models, video understanding models, and/or unified native video generative models
Public Compensation:
$147,000/year to $208,000/year + bonus + equity + benefits
**Industry:** Internet
Equal Opportunity:
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].
-
Recent Jobs
-
AI Research Scientist, Computer Vision - Facebook Video Intelligence
- Meta (Menlo Park, CA)