Senior Multimodal Researcher- Spatial Audio AI
Atlanta, US
Full Time Senior-level / Expert USD 118K - 163K
Dolby Laboratories
Dolby entwickelt Audio-, Bild- und Sprachtechnologien für Film, TV, Musik und Spiele. Erleben Sie alles mit beeindruckendem Klang und atemberaubendem Bild
Join the leader in entertainment innovation and help us design the future. At Dolby, science meets art, and high tech means more than computer code. As a member of the Dolby team, you’ll see and hear the results of your work everywhere, from movie theaters to smartphones. We continue to revolutionize how people create, deliver, and enjoy entertainment worldwide. To do that, we need the absolute best talent. We’re big enough to give you all the resources you need, and small enough so you can make a real difference and earn recognition for your work. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a Flex Work approach that is truly flexible to support where, when, and how you do your best work.
The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby’s continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.
We are seeking talented Senior Multimodal Researchers to join the Perceptual and Interactive Multimedia Computing team in the Multimodal Experiences Lab.
We are a key research team within Dolby’s Advanced Technology Group, focused on creating cutting edge multimodal technologies that drive next generation experiences. We’re looking for skilled researchers who are excited to advance the state of the art in technologies of interest to Dolby as well as the human society at large, in particular, in the area of developing AI solutions for Spatial Media/XR audio content creation workflow.
We welcome the opportunity to have you join our growing Atlanta Advanced Technology Research team.
Key Responsibilities:
- Develop AI models for spatial audio content creation and audio engineering workflow
- Develop multimodal foundation models for XR, with audio focus.
- Develop and combine deep learning methodologies with perceptually relevant signal processing and metrics.
- Partner with ATG researchers, develop solutions for the relevant applications.
What you need to succeed
Competencies:
- Technical depth: Necessary technical knowledge to create new AI algorithms and multimodal models with an audio focus. Solid knowledge of Audio, ML and AI fundamentals.
- Explore new technologies: Openness to learn new skills, work with cutting-edge technologies, and innovate in new areas.
- Invent & Innovate: Develop know-how, algorithms and software tools with both a short and long-term focus that further strengthen Dolby as a world leader for sight and sound experiences associated with digital content consumption. Then influence and collaborate with business group partners putting the technology into production.
- Work with a sense of Urgency: Respond aggressively to changing trends and new technologies and creates new algorithms to capitalize on them. Take appropriate risks to be ahead of the competition and the market.
- Collaborate: Collaborate with and influence peers in developing industry-leading technologies. Work with external trendsetters and technology drivers in academia and in partner enterprises.
Desired Background:
- PhD in Computer Science, Electrical and Computer Engineering, or similar fields
- Proven ability to pursue new areas of multimodal research for Audio, AI, and signal analysis, and demonstrate results through projects, prototypes, patent filings, and papers in peer reviewed journals and conferences
- High comfort level in creating Algorithms in Python
- Solid knowledge on audio signal analysis, spatial analysis, creation, and generation
- Solid knowledge on AI/ML, e.g. large language model and generative AI
- Familiarity with deep learning frameworks, e.g., TensorFlow, PyTorch, etc..
- Excellent problem-solving and partnership skills
- Excellent communication and presentation skills
The Atlanta Area base salary range for this full-time position is $118,700-$163,300, which can vary if outside this location, plus bonus, benefits, and some roles may also include equity. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, competencies, experience, market demands, internal parity, and relevant education or training. Your recruiter can share more about the specific salary range and perks and benefits for your location during the hiring process.
Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12
Equal Employment Opportunity:
Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.
Tags: Classification Computer Science Computer Vision Content creation Deep Learning Distributed Systems Engineering Generative AI LLMs Machine Learning PhD Python PyTorch Research TensorFlow
Perks/benefits: Career development Conferences Equity / stock options Salary bonus Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.