Foundational AI Research Intern (PhD, Fall 2025)

Atlanta, US

Dolby Laboratories

Dolby entwickelt Audio-, Bild- und Sprachtechnologien für Film, TV, Musik und Spiele. Erleben Sie alles mit beeindruckendem Klang und atemberaubendem Bild

View all jobs at Dolby Laboratories

Apply now Apply later

 

Join the leader in entertainment innovation and help us design the future. At Dolby, science meets art, and high tech means more than computer code. As a member of the Dolby team, you’ll see and hear the results of your work everywhere, from movie theaters to smartphones. We continue to revolutionize how people create, deliver, and enjoy entertainment worldwide. To do that, we need the absolute best talent. We’re big enough to give you all the resources you need, and small enough so you can make a real difference and earn recognition for your work. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a Flex Work approach that is truly flexible to support where, when, and how you do your best work.

 

The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby’s continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.

 

PhD Research Intern – Foundational AI

Join the world leader in innovation and building unique entertainment experiences at Dolby. The Advanced Technology Group (ATG) at Dolby works at imagining, creating, and integrating cutting-edge technologies at Dolby and is central to its innovation. As a Research Intern at ATG, you will imagine new visual, audio, and multimodal experiences that enable content creators to deliver their stories with maximum impact while allowing consumers to enjoy these experiences with unprecedented quality and immersion.

The Advanced Technology Group (ATG) at Dolby is at the forefront of research and development in audio-visual technologies. Our team explores novel approaches in spatial audio processing, high dynamic range (HDR) imaging, computer vision, machine learning, and perceptual modeling. We collaborate across disciplines to push the boundaries of what is possible in entertainment technology, translating cutting-edge research into innovations that shape the future of media experiences worldwide.

About the role

While traditional focus on foundation models has centered around standard audio and visual technologies, Dolby is pioneering the application of these powerful AI systems to enhance media experiences. We are seeking a PhD research intern to join our Foundational AI lab to explore how large-scale models can be leveraged for spatial audio, HDR imaging/video, and high pixel depth imaging/video generation and representation learning.

As a Research Intern, you will:

  • Design and implement novel neural architectures for processing enhanced media formats.
  • Develop training methodologies for foundation models that can understand and generate high-fidelity audio-visual content.
  • Research techniques for efficient fine-tuning of large models for Dolby-specific applications.
  • Collaborate with cross-functional teams to integrate research findings into Dolby's product ecosystem.
  • Present research findings to internal stakeholders and potentially at academic conferences.

The role will be based out of our research facility in Atlanta, GA, and offers the opportunity to work with state-of-the-art computing resources and proprietary datasets.

Requirements

  • Currently enrolled in a PhD program in Computer Science, Electrical Engineering, Machine Learning, Computational Media, or related fields.
  • Strong background in imaging/video/audio modeling and understanding.
  • Demonstrable proficiency in training and fine-tuning large models (diffusion models, transformers, autoregressive models, etc.).
  • Solid understanding of deep learning fundamentals and experience with frameworks such as PyTorch.
  • Excellent programming skills in Python.
  • Ability to work independently and as part of a collaborative research team.

Desirable Experience

  • First-authored publication in relevant domains at top conferences such as CVPR, ICCV, NeurIPS, ICLR, ICML, ICASSP, and similar venues.
  • Experience with scaling up model training across multiple GPUs across hybrid infrastructures.
  • Familiarity with audio processing, computer vision, or multimedia systems.
  • Knowledge of perceptual quality metrics for audio and visual media.
  • Prior work with HDR imaging or spatial audio technologies.

Application Process

We will review applications on a rolling basis. Qualified candidates should submit a CV, research statement, and relevant publications or project examples.

Join us in shaping the future of entertainment technology through the power of AI and foundation models.

Eligibility

Currently enrolled in Doctoral program. Recent grads who are within 6 months of graduation are also eligible to apply. Must be available to work full-time Monday – Friday for 12 weeks between September 2025 – December 2025. 

The start date for this internship is as follows (please note these dates are not flexible):

  • Monday, September 22, 2025

The Atlanta Area base salary range for this full-time position is $57/hr, which can vary if outside this location, plus bonus, benefits, and some roles may also include equity. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, competencies, experience, market demands, internal parity, and relevant education or training. Your recruiter can share more about the specific salary range and perks and benefits for your location during the hiring process.

 

Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12

 

Equal Employment Opportunity:
Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.

Apply now Apply later
Job stats:  1  0  0

Tags: Architecture Autoregressive models Classification Computer Science Computer Vision Deep Learning Diffusion models Distributed Systems Engineering ICLR ICML Machine Learning Model training NeurIPS PhD Python PyTorch Research Transformers

Perks/benefits: Career development Conferences Equity / stock options Salary bonus Startup environment

Region: North America
Country: United States

More jobs like this