Research Engineer, Multimodal (All Industry Levels)

Menlo Park or New York City

Character.AI

Chat with millions of AI Characters anytime, anywhere. Super-intelligent chat bots that hear you, understand you, and remember you. Free to use with no ads.

View all jobs at Character.AI

Apply now Apply later

About the role

We’re looking for scrappy and self-motivated people who have full-stack machine learning skills: collecting data, training state-of-the-art models, building evaluations, writing efficient inference algorithms, and iterating on user feedback.

In the day-to-day, you will be responsible for developing new multimodal capabilities end-to-end. This means you will need to wear a lot of hats across the full ML stack. You should be comfortable thinking about all parts of the problem, and ready to work on any and all components of it.

Responsibilities

  • Determining the type of training data we need, finding where we can collect it, and writing distributed data gathering pipelines to ingest data.

  • Developing new model architectures that push the state-of-the-art in terms of quality, scale, and inference speed.

  • Creating new evaluations that capture different aspects of generative outputs

  • Writing fast inference algorithms to serve these models at scale.

  • Working with product teams to integrate feedback mechanisms into the product, which we use to improve the model.

  • Working with large scale image/audio datasets.

Requirements

  • "All Industry Levels": at least PhD (or equivalent) research experience.

  • Experiences in diffusion modeling, large scale image/video data processing, and transformer model training.

  • Strong engineering skills in deep learning frameworks of PyTorch.

  • Track record of releases, publications, and/or open source projects relating to image / video generation, diffusion modeling, transformers, and so on.

  • Have a deep understanding of the “whole stack” and track record of successfully owning projects from start to finish when it comes to designing, training, evaluating and deploying machine learning models, especially large language models.

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.


In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.


Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

Apply now Apply later
Job stats:  0  0  0

Tags: Architecture Deep Learning Diffusion models Engineering LLMs Machine Learning ML models Model training Open Source PhD Pipelines PyTorch Research Transformers

Region: North America
Country: United States

More jobs like this