Postdoctoral Researcher, AI Infrastructure (PhD)
Menlo Park, CA
Meta
Giving people the power to build community and bring the world closer together
The Model Quality Enablement (MQE) team is a group of Machine Learning (ML) experts within AI Infra - Trainers organization. Our unique strength is applying ML knowledge in the Infra domain to resolve challenges that affect model quality. Model Quality, commonly measured in Normalized Entropy (NE) at Meta, is one of the most important metrics for model training. It reflects the business impact a model can achieve. As such, it is critical to preventing model quality regression, which is common in enabling new training techniques or imposing resource constraints (e.g., training the same model with fewer GPUs).
The MQE team works on a wide range of models in various projects, for instance:
Build scalable solutions to root-cause, mitigate and prevent model quality regressions for various recommendation and ranking models..
Enable end2end training of the new Sequential-based RecSys models and close the model quality gap between training <> inference models.
Conduct applied research on developing novel training algorithms (e.g., Parameter Efficient Fine Tuning of LLMs) that maintain or improve model quality with various resource constraints.
Our methodology is to develop fundamental model quality guardrails and optimization methods that are model type and training framework agnostic, rooting from a deep understanding of large scale model training/optimization process. The team collaborates closely with product groups including Ads (ML Infra and Core Ranking) and GenAI (MetaAI Search Engine). We also partner with other lower-level infra teams such as PyTorch and Performance & Capacity.
The team offers great learning and growth opportunities for Infra/System generalists who want to develop ML skills, and vice versa for ML/AI generalists who want to learn about training infra and distributed systems.
We are looking for PostDoc candidates who are interested in applied research on:
You will have the opportunity to solve critical business problems at Meta by applying SotA techniques in the domain of LLMs, representation learning, RecSys and more. Publications to top Machine Learning conferences are encouraged and will be sponsored. This role will be a great entry point to experience doing applied research at an industry-leading company that created LLaMa.Postdoctoral Researcher, AI Infrastructure (PhD) Responsibilities
$117,000/year to $173,000/year + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity and Affirmative Action Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com.
The MQE team works on a wide range of models in various projects, for instance:
Build scalable solutions to root-cause, mitigate and prevent model quality regressions for various recommendation and ranking models..
Enable end2end training of the new Sequential-based RecSys models and close the model quality gap between training <> inference models.
Conduct applied research on developing novel training algorithms (e.g., Parameter Efficient Fine Tuning of LLMs) that maintain or improve model quality with various resource constraints.
Our methodology is to develop fundamental model quality guardrails and optimization methods that are model type and training framework agnostic, rooting from a deep understanding of large scale model training/optimization process. The team collaborates closely with product groups including Ads (ML Infra and Core Ranking) and GenAI (MetaAI Search Engine). We also partner with other lower-level infra teams such as PyTorch and Performance & Capacity.
The team offers great learning and growth opportunities for Infra/System generalists who want to develop ML skills, and vice versa for ML/AI generalists who want to learn about training infra and distributed systems.
We are looking for PostDoc candidates who are interested in applied research on:
You will have the opportunity to solve critical business problems at Meta by applying SotA techniques in the domain of LLMs, representation learning, RecSys and more. Publications to top Machine Learning conferences are encouraged and will be sponsored. This role will be a great entry point to experience doing applied research at an industry-leading company that created LLaMa.Postdoctoral Researcher, AI Infrastructure (PhD) Responsibilities
- The application of LLMs in the domain of large-scale model training and productivity.
- The application of LLMs in Recommendation Systems.
- Novel training techniques for LLMs and Recommendation Systems with resource constraints.
- Currently has or is in the process of obtaining a PhD degree or completing a postdoctoral assignment in Computer Science or a similar field. Degree must be completed prior to joining Meta.
- Research experience in NLP, Large Language Models, RecSys.
- Experience with PyTorch and distributed machine learning.
- Programming experience using Python and PyTorch.
- Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta.
- Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment.
- Demonstrated research and software engineering experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub).
- Experience solving complex problems and comparing alternative solutions, tradeoffs, and diverse points of view to determine a path forward.
- Experience working and communicating cross functionally in a team environment.
$117,000/year to $173,000/year + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity and Affirmative Action Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com.
Job stats:
4
1
0
Tags: Computer Science Distributed Systems Engineering Generative AI GitHub LLaMA LLMs Machine Learning ML infrastructure Model training NLP Open Source PhD Physics Postdoc Python PyTorch Research VR
Perks/benefits: Career development Conferences Equity / stock options Health care Salary bonus
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Staff Machine Learning Engineer jobsData Engineer II jobsStaff Data Scientist jobsBI Developer jobsData Scientist II jobsPrincipal Data Engineer jobsData Manager jobsJunior Data Analyst jobsData Science Manager jobsSenior AI Engineer jobsResearch Scientist jobsBusiness Data Analyst jobsData Specialist jobsPrincipal Software Engineer jobsData Science Intern jobsLead Data Analyst jobsData Analyst Intern jobsData Analyst II jobsSr. Data Scientist jobsBI Analyst jobsData Engineer III jobsSoftware Engineer II jobsSoftware Engineer, Machine Learning jobsAzure Data Engineer jobsJunior Data Engineer jobs
Snowflake jobsLinux jobsEconomics jobsOpen Source jobsComputer Vision jobsRDBMS jobsData Warehousing jobsHadoop jobsGoogle Cloud jobsMLOps jobsAirflow jobsKafka jobsNoSQL jobsClassification jobsBanking jobsJavaScript jobsScala jobsKPIs jobsData warehouse jobsStreaming jobsScikit-learn jobsPhysics jobsLooker jobsOracle jobsR&D jobs
PostgreSQL jobsPySpark jobsTerraform jobsGitHub jobsCX jobsRobotics jobsPandas jobsSAS jobsBigQuery jobsScrum jobsData Mining jobsIndustrial jobsDistributed Systems jobsRedshift jobsJira jobsUnstructured data jobsPharma jobsMicroservices jobsdbt jobsJenkins jobsGPT jobsRAG jobsE-commerce jobsReact jobsMySQL jobs