Research Scientist, Speech

Berlin

Cantina

A new social platform where you can create, share, and interact with Al bots live with friends.

View all jobs at Cantina

Apply now Apply later

A bit about Cantina:

Cantina, founded by Sean Parker, is a new social platform with the most advanced AI character creator. Build, share, and interact with AI bots and your friends directly in the Cantina or across the internet.

Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet. Recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters. Bots are a new media type that offer a way for creators to share infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

A bit about the role: 

We are seeking talented Research Scientists to join our team, focused on advancing the capabilities of our AI-driven social platform. As a Research Scientist, you will play a pivotal role in developing state-of-the-art speech models that enable our AI bots to perceive and interact with audio and speech in real-time.

As a Research Scientist, you will:

  • Conduct research to develop novel and scalable algorithms and models for speech generation, voice cloning, voice conversion, and voice generation. 

  • Focus on optimization and efficiency of novel large-scale model architectures to enable real-time interactions within the product

  • Collaborate with product, design and engineering teams to develop research prototypes solving particular product problems.

  • Explore and analyze cutting-edge research,survey, evaluate, and integrate new techniques quickly.

  • Work on LLMs, GANs, Diffusion and Flow matching models.

  • Publish papers and open-source research breakthroughs within the community

A bit about you:

  • Ph.D. or equivalent experience in Computer Science, Electrical Engineering, or related fields with a focus on generative modeling, including large language modeling, speech recognition, text-to-speech or computer vision

  • Proven track record demonstrated through publications at top-tier conferences, or journals, open-source project contributions, and patents. 

  • Excellent hands-on experience with training large-scale generative LLMs and diffusion models and GANs. 

  • Proficiency in signal processing and speech modeling techniques.

  • Hands-on experience with large-scale datasets, data augmentation, and self-supervised learning for speech tasks.

  • Strong programming skills in Python with deep learning frameworks (PyTorch, JAX).

  • Prior experience in deploying research to real-world applications

  • Ability to work independently and collaboratively in a fast-paced, dynamic environment, with a strong sense of ownership and drive to deliver impactful results.

Why Join Cantina AI?

  • Opportunity to work on groundbreaking AI technologies that redefine social-media and human-computer interaction.

  • A collaborative and inclusive culture that values innovation, continuous learning, and professional growth.

  • Competitive compensation package including equity options, healthcare benefits, a generous wellness stipend and flexible work arrangements.

Please note:

  • This is a remote role from anywhere in  the GMT time zone.

  • The salary starts at $200-275k + stock option plan.

  • This is full-time employment only - no contractors possible - usually through Remote.com.

  • We are unable to sponsor location based visas.

Application Process: Please submit your resume, cover letter, and any relevant portfolio or publications demonstrating your research contributions in AI.

Apply now Apply later
Job stats:  2  0  0

Tags: Architecture ASR Computer Science Computer Vision Deep Learning Diffusion models Engineering GANs Generative modeling JAX LLMs NLP Open Source Python PyTorch Research

Perks/benefits: Career development Competitive pay Conferences Equity / stock options Wellness

Regions: Remote/Anywhere Europe
Country: Germany

More jobs like this