Research Scientist, Multimodal Assistive Agents (Astra) - Tokyo

Tokyo, Japan

Full Time USD 64K - 119K *

DeepMind

Artificial intelligence could be one of humanity’s most useful inventions. We research and build safe artificial intelligence systems. We're committed to solving intelligence, to advance science...

View all jobs at DeepMind

Apply now Apply later

Posted 1 week ago

Tokyo, Japan

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Snapshot

We are seeking a highly motivated and innovative Research Scientist to join our team in Tokyo, focused on advancing the state-of-the-art in multimodal assistive agents (Astra).

https://deepmind.google/technologies/project-astra/

You will conduct cutting-edge research on large language models (LLMs), particularly in the multimodal domain (vision, audio, text), focusing on the development of more capable, robust, and helpful assistive agents, with a direct path to impacting millions of users through Google products. This role offers a unique opportunity to contribute to fundamental research in AI while collaborating with a world-class team at Google DeepMind and seeing your work translated into real-world applications. If you are passionate about shaping the future of human-computer interaction through AI and are eager to make a significant impact in the rapidly evolving landscape of assistive technologies, we encourage you to apply.

About us

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

The role

As a Research Scientist at Google DeepMind, you will:

Key responsibilities

Conduct cutting-edge research: Develop novel algorithms, models, and techniques for multimodal assistive agents, pushing the boundaries of AI research in areas such as natural language understanding, computer vision, speech processing, and reinforcement learning. Your research will directly contribute to advancing Google’s assistive agent capabilities in products.
Develop and evaluate models: Design, implement, and evaluate multimodal assistive AI agents. Explore techniques like prompt engineering, few-shot learning, post-training techniques to improve model performance and robustness in diverse real-world scenarios. Your research will be focused on building assistive agents that can perceive, reason, plan, and interact with humans in more natural and intuitive ways, ultimately shaping user experience in Google products.
Collaborate with a world-class team: Work closely with other research scientists, engineers, and product teams across Google DeepMind, fostering a collaborative and intellectually stimulating environment. Share your research findings through publications in top-tier conferences and journals, while also contributing to the development of impactful products.
Contribute to real-world impact: See your research contribute to the development of next-generation multimodal assistive agents with applications across various domains, including education, healthcare, gaming, accessibility, and more, directly influencing the future of Google products and services.
Stay at the forefront of AI research: Continuously explore emerging trends and new research directions in multimodal AI. Participate in international conferences and workshops to share your work and learn from others in the field, bringing these cutting-edge advancements to Google’s product landscape.

About you

You are a passionate and talented researcher with a strong theoretical foundation and a proven ability to conduct impactful research in AI. You have a collaborative mindset and are excited to work as part of a team to tackle ambitious research challenges. You are passionate about seeing your research translated into real-world products that improve the lives of users and are eager to work in an environment where research has a direct path to product impact. You are eager to see your research contribute to real-world applications and are driven by a desire to create positive change through AI.

Ph.D. in Computer Science, Artificial Intelligence, or a related field.
Strong publication record in top-tier machine learning conferences or journals.
Solid understanding of deep learning, natural language processing, computer vision, and/or speech processing.
Experience with relevant ML frameworks such as JAX, TensorFlow, or PyTorch.

In addition, the following would be an advantage:

Experience with multimodal learning, large language models, and/or assistive AI agents.
Experience with prompt engineering, few-shot learning, post-training techniques, and evaluations.
Familiarity with large-scale model training and deployment.
Strong programming skills in Python or similar languages.
Excellent communication and collaboration skills.
Japanese language skills are a plus.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 8 1 0

Categories: Data Science Jobs Research Jobs

Tags: Computer Science Computer Vision Deep Learning Engineering JAX LLMs Machine Learning Model training NLP Prompt engineering Python PyTorch Reinforcement Learning Research TensorFlow