-
AI Research Scientist (Technical Leadership), Data…
- Meta (Menlo Park, CA)
-
Summary:
Meta is seeking research scientists to help us build the data foundation for Meta's most advanced Large Language and Media Models. We're looking for researchers with LLM expertise to join us on working with data at scale and to push beyond the data ceiling.Our team contributes to data curation across all stages of LLM development (pre-training, mid-training, post-training) and all domains/modalities (e.g., web, code, image, video, multilingual). We tackle the hardest challenges at trillion-scale, including organic data curation, synthetic data generation, agent and interaction data, and frontier paradigms that redefine what's possible. Based in Meta Superintelligence Labs (MSL) within the Fundamental AI Research Organization (FAIR), you'll directly contribute to Meta’s frontier models like Llama, while having the chance to collaborate with researchers and engineers across MSL.
Required Skills:
AI Research Scientist (Technical Leadership), Data Research - MSL FAIR Responsibilities:
1. Collaborate with cross-functional teams to develop Meta’s next foundational models
2. Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
3. Architect efficient and scalable data curation systems and pipelines
4. Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
5. Execute on high priority projects in pre-training, mid-training, or post-training data curation
6. Apply specialized expertise in video/image generation, video/image perception, OCR, agentic data, synthetic data, reasoning data, web parser, coding data, data scaling laws, or datamix optimization
7. Lead complex technical projects end-to-end
Minimum Qualifications:
Minimum Qualifications:
8. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
9. PhD in Computer Science or a related technical field
10. 4+ years of industry research experience in NLP or CV
11. 4+ years as a formal technical lead experience
12. Experience leading major technical initiatives with cross-functional impact and influencing strategy across multiple teams
13. Practical experience with multimodal pre-training or mid-training data curation for large language models, media perception, or media generation models
14. Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Preferred Qualifications:
Preferred Qualifications:
15. Experience working on frontier-quality/ state-of-the-art Large Language or Large Media Models
16. First-author publications at top peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV)
17. Programming experience in Python and hands-on experience with frameworks like PyTorch or Spark, or related distributed computing frameworks (Ray, DataFlow)
18. Hands-on experience on SQL and large-scale data handling, with familiarity of frameworks like Spark and Hive
Public Compensation:
$213,000/year to $293,000/year + bonus + equity + benefits
**Industry:** Internet
Equal Opportunity:
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].
-
Recent Searches
- Per Diem Speech Language (Queens County, NY)
- Geotechnical Engineer 7 Geo (Columbus, OH)
- Director Biz Experience Data (Washington)
- Quantitative Analytics Model Consultant (Charlotte, NC)
Recent Jobs
-
AI Research Scientist (Technical Leadership), Data Research - MSL Fair
- Meta (Menlo Park, CA)
-
Collection Development and Licensing Coordinator (Research Assistant Librarian) for the Marriott Library
- University of Utah (Salt Lake City, UT)