Research Scientist Manager, Generative AI - Llama Pre-training
Menlo Park, CA | New York, NY
Meta
Giving people the power to build community and bring the world closer together
The Generative AI organization at Meta is seeking a Research Scientist Manager to join our team and work on the next generation of large language models, particularly focusing on pre-training and mid-training data. As a Research Scientist Manager, you will play a critical role in building our series of Llama models and setting direction from the earliest stages, with a focus on data including curation, ablations, scaling laws, and defining new tasks (e.g. reasoning, coding). Data is the single largest lever we have to improving our models.Research Scientist Manager, Generative AI - Llama Pre-training Responsibilities
$177,000/year to $251,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
- Drive end-to-end development of large language models, including data sourcing and curation, filtering, experiment design, evaluation and more
- Drive efficiency gains on training and deployment of LLMs through novel techniques
- Lead a team of applied researchers to democratize Llama for Meta's users
- Communicate, collaborate, and build relationships with clients and peer teams to facilitate cross-functional projects
- Remain up-to-date on ongoing research and software development activities in the team, help work through technical challenges, and be involved in design decisions
- Remain involved in the research community, both understanding trends, and setting them
- 5+ years of hands-on experience in large language model, NLP, and Transformer modeling, in the setting of both research and engineering development
- Experience and track record of landing large research and/or product impacts in a fast-paced environment
- 3+ years of hands-on supporting and leading teams of research scientists and software engineers
- Proven technical vision in where the field of generative AI will go
- Experience of and knowledge of data curation techniques (training set preparation, ablation experiments, etc.)
- Experience with cross functional collaboration with other teams including non-engineering functions
- Demonstrated experience recruiting, building, structuring, leading technical organizations, including performance management
- PhD in deep learning, artificial intelligence, and/or related technical field
- Experience and knowledge of ML frameworks like PyTorch, TensorFlow, etc.
- Experience and knowledge of large-scale data platforms such as Spark, Hive, etc.
- Experience and knowledge of working with LLM frameworks like LangChain
- Experience and knowledge of training LLMs, fine-tuning on datasets, especially Llama
$177,000/year to $251,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
Job stats:
4
0
0
Tags: Deep Learning Engineering Generative AI LangChain LLaMA LLMs Machine Learning NLP PhD Physics PyTorch Research Spark TensorFlow VR
Perks/benefits: Career development Equity / stock options Health care Salary bonus Team events
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsStaff Data Scientist jobsPrincipal Data Engineer jobsSr. Data Engineer jobsStaff Machine Learning Engineer jobsPrincipal Software Engineer jobsData Science Manager jobsData Manager jobsData Science Intern jobsSoftware Engineer II jobsDevOps Engineer jobsBusiness Intelligence Analyst jobsJunior Data Analyst jobsData Analyst Intern jobsData Specialist jobsBusiness Data Analyst jobsLead Data Analyst jobsStaff Software Engineer jobsSr. Data Scientist jobsSenior Backend Engineer jobsData Governance Analyst jobsAI/ML Engineer jobsData Engineer III jobsResearch Scientist jobs
Consulting jobsAirflow jobsMLOps jobsOpen Source jobsKPIs jobsKafka jobsJavaScript jobsEconomics jobsLinux jobsTerraform jobsNoSQL jobsData Warehousing jobsComputer Vision jobsGoogle Cloud jobsGitHub jobsRDBMS jobsPostgreSQL jobsR&D jobsScikit-learn jobsStreaming jobsPhysics jobsData warehouse jobsBanking jobsHadoop jobsdbt jobs
Scala jobsLooker jobsPandas jobsOracle jobsBigQuery jobsClassification jobsReact jobsRAG jobsCX jobsScrum jobsDistributed Systems jobsPySpark jobsIndustrial jobsPrompt engineering jobsELT jobsJira jobsMicroservices jobsRedshift jobsGPT jobsRobotics jobsTypeScript jobsOpenAI jobsLangChain jobsSAS jobsJenkins jobs