Fundamental AI Research Scientist, Multimodal Audio (Speech, Sound and Music) - FAIR
Menlo Park, CA | Seattle, WA | Boston, MA | New York, NY
Meta
Giving people the power to build community and bring the world closer together
Meta is seeking Research Scientists to join its Fundamental AI Research (FAIR) organization, focused on making significant advances in AI. We publish groundbreaking papers and release frameworks/libraries that are widely used in the open-source community. The team is working on the industrial leading research on building foundation models for audio understanding and audio generation. We are also closely working with vision research teams on pushing the frontier of multimodality (audio, video, language) research.
Our teams research is focusing on audio and multimodality. Individuals in this role are expected to be recognized experts in identified research areas such as artificial intelligence, speech and audio generation and audio-visual learning. Researchers will drive impact by: (1) publishing state-of-the-art research papers, (2) open sourcing high quality code and reproducible results for the community, and (3) bringing the latest research to Meta products for connecting billions of users. They will work with an interdisciplinary team of scientists, engineers, and cross-functional partners, and will have access to cutting edge technology, resources, and research facilities.Fundamental AI Research Scientist, Multimodal Audio (Speech, Sound and Music) - FAIR Responsibilities
$147,000/year to $208,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
Our teams research is focusing on audio and multimodality. Individuals in this role are expected to be recognized experts in identified research areas such as artificial intelligence, speech and audio generation and audio-visual learning. Researchers will drive impact by: (1) publishing state-of-the-art research papers, (2) open sourcing high quality code and reproducible results for the community, and (3) bringing the latest research to Meta products for connecting billions of users. They will work with an interdisciplinary team of scientists, engineers, and cross-functional partners, and will have access to cutting edge technology, resources, and research facilities.Fundamental AI Research Scientist, Multimodal Audio (Speech, Sound and Music) - FAIR Responsibilities
- Develop algorithms based on state-of-the-art machine learning and neural network methodologies
- Perform research to advance the science and technology of intelligent machines.
- Conduct research that enables learning the semantics of data across multiple modalities (audio, speech, images, video, text, and other modalities).
- Work towards long-term ambitious research goals, while identifying intermediate milestones.
- Design and implement models and algorithms
- Work with large datasets, train / tune / scale the models, create benchmarks to evaluate the performance, open source and publish
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- PhD degree in AI, computer science, data science, or related technical fields, or equivalent practical experience.
- 2+ years of experience holding an industry, faculty, academic, or government researcher position.
- Research publications reflecting experience in related research fields: audio (speech, sound, or music) generation, text-to-speech (TTS) synthesis, text-to-music generation, text-to-sound generation, speech recognition, speech / audio representation learning, vision perception, image / video generation, video-to-audio generation, audio-visual learning, audio language models, lip sync, lip movement generation / correction, lip reading, etc.
- Familiarity with one or more deep learning frameworks (e.g. pytorch, tensorflow, …)
- Experienced in Python programming language.
- Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.
- First-authored publications at peer-reviewed conferences, such as ICML, NeuRIPS, ICLR, ICASSP, Interspeech, ACL, EMNLP, CVPR, and other similar venues.
- Research and engineering experience demonstrated via publications, grants, fellowships, patents, internships, work experience, open source code, and / or coding competitions.
- Experience solving complex problems and comparing alternative solutions, trade-offs, and diverse points of view.
- Experience working and communicating cross functionally in a team environment.
- Experience communicating research findings to public audiences of peers.
$147,000/year to $208,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
Job stats:
2
0
0
Categories:
Data Science Jobs
Deep Learning Jobs
Research Jobs
Tags: ASR Computer Science Deep Learning EMNLP Engineering ICLR ICML Industrial Machine Learning NeurIPS NLP Open Source PhD Physics Python PyTorch Research TensorFlow VR
Perks/benefits: Career development Conferences Equity / stock options Health care Salary bonus
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsSr. Data Engineer jobsPrincipal Data Engineer jobsStaff Data Scientist jobsBusiness Intelligence Analyst jobsStaff Machine Learning Engineer jobsData Science Manager jobsPrincipal Software Engineer jobsData Manager jobsData Science Intern jobsJunior Data Analyst jobsSoftware Engineer II jobsDevOps Engineer jobsData Analyst Intern jobsData Specialist jobsBusiness Data Analyst jobsSr. Data Scientist jobsStaff Software Engineer jobsLead Data Analyst jobsAI/ML Engineer jobsResearch Scientist jobsSenior Backend Engineer jobsData Engineer III jobsBI Analyst jobs
NLP jobsAirflow jobsOpen Source jobsEconomics jobsMLOps jobsTerraform jobsKPIs jobsNoSQL jobsKafka jobsLinux jobsJavaScript jobsComputer Vision jobsData Warehousing jobsRDBMS jobsGoogle Cloud jobsPostgreSQL jobsPhysics jobsBanking jobsGitHub jobsScikit-learn jobsHadoop jobsScala jobsStreaming jobsData warehouse jobsPandas jobs
R&D jobsOracle jobsdbt jobsCX jobsBigQuery jobsClassification jobsLooker jobsReact jobsDistributed Systems jobsPySpark jobsScrum jobsRAG jobsRedshift jobsJira jobsELT jobsRobotics jobsPrompt engineering jobsMicroservices jobsIndustrial jobsGPT jobsSAS jobsMySQL jobsData Mining jobsNumPy jobsTypeScript jobs