Researcher - Reinforcement Learning and LLM Reasoning

Edmonton, Alberta, Canada

Huawei Technologies Canada Co., Ltd.

Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices.

View all jobs at Huawei Technologies Canada Co., Ltd.

Our team has a 12-month contract opening for a Researcher.

Responsibilities:

  • Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
  • Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
  • Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
  • Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.

Requirements

What you'll bring to the team:

  • PhD or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
  • Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
  • Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
  • Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
  • Proficient programming skills in Python and strong experience with model development and optimization.
  • Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
  • Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  4  0  0

Tags: Computer Science Deep Learning ICLR ICML LLMs Machine Learning Mathematics ML models NeurIPS NLP PhD Python PyTorch Reinforcement Learning Research RLHF TensorFlow

Perks/benefits: Conferences

Region: North America
Country: Canada

More jobs like this