Researcher - Reinforcement Learning and LLM Reasoning

Huawei Technologies Canada Co., Ltd.

Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices.

Find more jobs like this Jobs in Canada

Posted 9 months ago

Our team has a 12-month contract opening for a Researcher.

Responsibilities:

Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.

What you'll bring to the team:

PhD or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
Proficient programming skills in Python and strong experience with model development and optimization.
Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 4 0 0

Categories: Machine Learning Jobs Research Jobs