Research Scientist Intern, AI Research - Speech & Audio (PhD)

Menlo Park, CA | Seattle, WA | San Francisco, CA

Apply now Apply later

The GenAI Speech team at Meta is currently looking for Research Scientist interns. Our team creates spoken language technology to make it faster and easier for people to build community and connect with others around the world. We conduct product-motivated research in ML/AI and design, develop and deploy state-of-the-art algorithms to the rest of Meta. We work in all aspects of AI for speech and audio processing, including speech recognition, speech synthesis, speaker identification, keyword spotting, noise robustness, multi-lingual systems, and speech with large language models (LLM). Our work powers voice interactions on AR/VR devices, such as Ray-Ban | Meta smart glasses and Quest 3 mixed-reality headsets, and video content understanding, including captioning and understanding of videos on Facebook and Instagram. As a Research Scientist Intern, you will help us develop innovative models and algorithms and apply them to large-scale production speech tasks.


Our teams at Meta AI offer twelve (12) to twenty-four (24) weeks long internships and we have various start dates throughout the year. Internships are available in the Bay Area, CA and Seattle, WAResearch Scientist Intern, AI Research - Speech & Audio (PhD) Responsibilities
  • Perform research to advance the science and technology of intelligent machines.
  • Develop novel and accurate speech algorithms and systems, leveraging deep learning and machine learning on big data resources.
  • Contribute research that can be applied to Meta product development.
  • Analyze and improve efficiency, scalability, and stability of various deployed systems.
  • Collaborate with team members from prototyping to production.
Minimum Qualifications
  • Currently has, or is in the process of obtaining a PhD degree in the field of Computer Science, Artificial Intelligence, Natural Language Processing, or related field..
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.
  • Experience in C/C++ and Python.
  • Experience in deep learning frameworks (PyTorch, Tensorflow, etc).
  • Research and/or work experience in machine learning, deep learning, and/or speech technology.
Preferred Qualifications
  • Experience manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources.
  • Proven track record of achieving results as demonstrated by grants, fellowships, patents, as well as first-authored publications at workshops or conferences such as Interspeech, ICASSP or similar.
  • A strong interest in theoretical and empirical research and for answering hard questions with research.
  • Interpersonal experience: cross-group and cross-culture collaboration.
  • Ability to stay in touch with the literature of a particular domain and has the ability to reproduce results if needed.
  • Experienced with training deep neural networks for key Speech tasks such as speech recognition, speech translation, speech synthesis, speaker diarization, sentiment analysis, acoustic event recognition, wake word, scene understanding, etc.
  • Intent to return to a degree-program after the completion of the internship/co-op.
For those who live in or expect to work from California if hired for this position, please click here for additional information. LocationsAbout Meta Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics. Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com. $7,500/month to $11,333/month + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Apply now Apply later
  • Share this job via
  • 𝕏
  • or
Job stats:  0  0  0

Tags: ASR Big Data Computer Science Deep Learning Generative AI LLMs Machine Learning NLP PhD Physics Prototyping Python PyTorch Research Speech synthesis TensorFlow VR

Perks/benefits: Career development Conferences Equity / stock options Health care Salary bonus

Region: North America
Country: United States

More jobs like this