AGI Model Architect / Research Scientist in AGI Model Architecture

US-Washington-Bellevue, United States

Full Time Senior-level / Expert USD 141K - 265K

Tencent

腾讯于1998年11月成立，是一家互联网公司，通过技术丰富互联网用户的生活，助力企业数字化升级。我们的使命是“用户为本科技向善”。Founded in 1998, Tencent is an Internet-based platform company using technology to enrich the lives of Internet users and assist the digital upgrade of enterprises. Our mission...

View all jobs at Tencent

Apply now Apply later

Posted 3 weeks ago

Business Unit

What the Role Entails

Job Overview:
We are committed to building the core architecture for Artificial General Intelligence (AGI) systems that match or surpass human-level capabilities. As a key contributor to our core R&D team, you will help develop large-scale models with multimodal perception, autonomous learning, and reasoning abilities, driving their generalization to real-world applications. Our goal is to design a native multimodal system—capable of understanding and generating across vision, speech, and text—while interacting deeply with the environment to catalyze the transition from AGI to ASI (Artificial Super Intelligence).

Responsibilities:

Design unified large model architectures with integrated capabilities in multimodal perception, reasoning, memory, and generation (across vision/audio/text).
Build systems that support continual learning, hierarchical memory, autonomous exploration, and self-evolution.
Advance the development of agent-based systems with autonomous task planning, cross-modal interaction, tool usage, and self-improvement capabilities.
Contribute deeply to the design of core components such as general representation learning, synchronized audio-visual modeling, world models, and sparse modeling.

Key Research Areas:

Multimodal Unified Architecture: Native co-frequency modeling and cross-modal reasoning across vision, speech, and language.
Continual Learning & Memory Mechanisms: Architectures that separate long-term memory from the core model to enable memory recall and task transfer.
World Modeling & Causal Reasoning: Enabling models to predict environmental states, plan behaviors, and update cognitive structures dynamically.
Sparse & Modular Architectures: Scalable, efficient, and interpretable ultra-large sparse model design.
Self-Evolution & Active Data Generation: Mechanisms for self-growth through reinforcement learning, self-supervision, and environment interaction.
Cross-Modal Understanding & Generation: Strengthening joint generation and decision-making capabilities in real-world physical environments.
Intelligent Agent Capability Transfer: Systematic enhancement of task generalization and tool-composition skills.

Who We Look For

Requirements:

Expertise in Transformer-based architectures and their applications in language and multimodal domains.
Hands-on experience in building or optimizing billion-scale models; familiar with training paradigms such as SFT (Supervised Fine-tuning), RLHF (Reinforcement Learning with Human Feedback), and self-supervised learning.
Preferred qualifications include deep understanding or practical experience in one or more of the following areas:
Multimodal models (e.g., vision-language models, audio-video models)
Reinforcement learning and autonomous agent systems
Complex reasoning and planning (e.g., search + LLMs, world modeling)
Sparse modeling and dynamic routing mechanisms
Strong engineering and system thinking capabilities, with the ability to translate cutting-edge research into production-level AGI model systems.
Publications in top-tier conferences/journals such as NeurIPS, ICLR, CVPR, ACL, etc., are highly desirable.

Location State(s)

US-Washington-Bellevue

The expected base pay range for this position in the location(s) listed above is $141,480.00 to $265,200.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Apply now Apply later

Job stats: 1 0 0

Categories: Architecture Jobs Data Science Jobs Research Jobs

Tags: AGI Architecture Engineering ICLR LLMs Model design NeurIPS R R&D Reinforcement Learning Research RLHF