-
ML Framework Software Engineer (PhD)
- Meta (Menlo Park, CA)
-
Summary:
This role is about developing the core PyTorch 2.0 technologies, innovating and advancing the state-of-the-art of ML compilers, and accelerating PT2 adoption through direct engagements with OSS and industry users.The PyTorch Compiler team is dedicated to making PyTorch run faster and more resource-efficient without sacrificing its flexibility and ease of use. The team is the driving force behind PT2, a step function change in PyTorch’s history that brought compiler technologies to the core of PyTorch. PT2 technologies have gained industry-wide recognition since their first release in March 2023. The team is committed to building the PT2 compiler that withstands the test of time while striving to become the #1 ML framework compiler in the industry. Our work is open source, cutting-edge, and industry leading.
Required Skills:
ML Framework Software Engineer (PhD) Responsibilities:
1. Develop the PT2 compiler (e.g., TorchDynamo, TorchInductor, PyTorch Distributed, PyTorch Core)
2. Improve PyTorch performance via systematic solutions for the entire community
3. Explore the intersection of the PyTorch compiler and PyTorch distributed
4. Optimize Generative AI models across the stack (pre-training, fine-tuning, and inference)
5. Collaborate with users of PyTorch to enable new use cases of PT2 technologies both inside and outside Meta
Minimum Qualifications:
Minimum Qualifications:
6. Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
7. Currently has or is in the process of obtaining a PhD degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
8. Research or industry experience in developing compilers, ML systems, ML accelerators, GPU performance, and similar
9. Advanced in Python or C++ programming
Preferred Qualifications:
Preferred Qualifications:
10. Experience in developing PyTorch/PT2, Triton, MLIR, JAX, XLA, TVM is a huge plus
11. Knowledge in GPU architecture, ML accelerator performance, and developing high-performance kernels
12. Experience in building OSS communities and extensive social media presence in the ML Sys domain
13. Experience with training models, end-to-end model optimizations, or applying ML to systems
14. Knowledge of communication collectives, PyTorch distributed, and parallelism
15. Experience in developing inside other ML frameworks like Caffe2, TensorFlow, ONNX, TensorRT
16. First-authored publications at peer-reviewed conferences (e.g. NeurIPS, MLSys, ASPLOS, PLDI, ICML, or similar)
Public Compensation:
$56.25/hour to $173,000/year + bonus + equity + benefits
**Industry:** Internet
Equal Opportunity:
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].
-