Research Scientist Manager, Generative AI - Llama Pre-training
Menlo Park, CA | New York, NY
Meta
Giving people the power to build community and bring the world closer together
The Generative AI organization at Meta is seeking a Research Scientist Manager to join our team and work on the next generation of large language models, particularly focusing on pre-training and mid-training data. As a Research Scientist Manager, you will play a critical role in building our series of Llama models and setting direction from the earliest stages, with a focus on data including curation, ablations, scaling laws, and defining new tasks (e.g. reasoning, coding). Data is the single largest lever we have to improving our models.Research Scientist Manager, Generative AI - Llama Pre-training Responsibilities
$177,000/year to $251,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
- Drive end-to-end development of large language models, including data sourcing and curation, filtering, experiment design, evaluation and more
- Drive efficiency gains on training and deployment of LLMs through novel techniques
- Lead a team of applied researchers to democratize Llama for Meta's users
- Communicate, collaborate, and build relationships with clients and peer teams to facilitate cross-functional projects
- Remain up-to-date on ongoing research and software development activities in the team, help work through technical challenges, and be involved in design decisions
- Remain involved in the research community, both understanding trends, and setting them
- 5+ years of hands-on experience in large language model, NLP, and Transformer modeling, in the setting of both research and engineering development
- Experience and track record of landing large research and/or product impacts in a fast-paced environment
- 3+ years of hands-on supporting and leading teams of research scientists and software engineers
- Proven technical vision in where the field of generative AI will go
- Experience of and knowledge of data curation techniques (training set preparation, ablation experiments, etc.)
- Experience with cross functional collaboration with other teams including non-engineering functions
- Demonstrated experience recruiting, building, structuring, leading technical organizations, including performance management
- PhD in deep learning, artificial intelligence, and/or related technical field
- Experience and knowledge of ML frameworks like PyTorch, TensorFlow, etc.
- Experience and knowledge of large-scale data platforms such as Spark, Hive, etc.
- Experience and knowledge of working with LLM frameworks like LangChain
- Experience and knowledge of training LLMs, fine-tuning on datasets, especially Llama
$177,000/year to $251,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
Job stats:
0
0
0
Tags: Deep Learning Engineering Generative AI LangChain LLaMA LLMs Machine Learning NLP PhD Physics PyTorch Research Spark TensorFlow VR
Perks/benefits: Career development Equity / stock options Health care Salary bonus Team events
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsSr. Data Engineer jobsPrincipal Data Engineer jobsStaff Data Scientist jobsBusiness Intelligence Analyst jobsStaff Machine Learning Engineer jobsData Science Manager jobsPrincipal Software Engineer jobsData Manager jobsData Science Intern jobsJunior Data Analyst jobsSoftware Engineer II jobsDevOps Engineer jobsData Analyst Intern jobsData Specialist jobsBusiness Data Analyst jobsSr. Data Scientist jobsStaff Software Engineer jobsLead Data Analyst jobsAI/ML Engineer jobsResearch Scientist jobsSenior Backend Engineer jobsData Engineer III jobsBI Analyst jobs
NLP jobsAirflow jobsOpen Source jobsEconomics jobsMLOps jobsTerraform jobsKPIs jobsNoSQL jobsKafka jobsLinux jobsJavaScript jobsComputer Vision jobsData Warehousing jobsRDBMS jobsGoogle Cloud jobsPostgreSQL jobsPhysics jobsBanking jobsGitHub jobsScikit-learn jobsHadoop jobsScala jobsStreaming jobsData warehouse jobsPandas jobs
R&D jobsOracle jobsdbt jobsCX jobsBigQuery jobsClassification jobsLooker jobsReact jobsDistributed Systems jobsPySpark jobsScrum jobsRAG jobsRedshift jobsJira jobsELT jobsRobotics jobsPrompt engineering jobsMicroservices jobsIndustrial jobsGPT jobsSAS jobsMySQL jobsData Mining jobsNumPy jobsTypeScript jobs