Researcher - Reinforcement Learning and LLM Reasoning
Edmonton, Alberta, Canada
Huawei Technologies Canada Co., Ltd.
Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices.Our team has a 12-month contract opening for a Researcher.
Responsibilities:
- Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
- Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
- Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
- Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.
Requirements
What you'll bring to the team:
- PhD or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
- Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
- Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
- Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
- Proficient programming skills in Python and strong experience with model development and optimization.
- Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
- Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
4
0
0
Categories:
Machine Learning Jobs
Research Jobs
Tags: Computer Science Deep Learning ICLR ICML LLMs Machine Learning Mathematics ML models NeurIPS NLP PhD Python PyTorch Reinforcement Learning Research RLHF TensorFlow
Perks/benefits: Conferences
Region:
North America
Country:
Canada
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Staff Machine Learning Engineer jobsData Scientist II jobsPrincipal Data Engineer jobsStaff Data Scientist jobsBI Developer jobsData Manager jobsJunior Data Analyst jobsResearch Scientist jobsData Science Manager jobsBusiness Data Analyst jobsLead Data Analyst jobsData Engineer III jobsSenior AI Engineer jobsData Specialist jobsData Science Intern jobsSr. Data Scientist jobsPrincipal Software Engineer jobsData Analyst Intern jobsAzure Data Engineer jobsSoftware Engineer II jobsData Analyst II jobsBI Analyst jobsSoftware Engineer, Machine Learning jobsJunior Data Engineer jobsSenior Data Scientist, Performance Marketing jobs
Snowflake jobsLinux jobsEconomics jobsOpen Source jobsBanking jobsHadoop jobsJavaScript jobsComputer Vision jobsRDBMS jobsPhysics jobsKafka jobsData Warehousing jobsMLOps jobsAirflow jobsNoSQL jobsKPIs jobsR&D jobsGoogle Cloud jobsScala jobsOracle jobsData warehouse jobsStreaming jobsClassification jobsPostgreSQL jobsGitHub jobs
Scikit-learn jobsSAS jobsCX jobsTerraform jobsScrum jobsPandas jobsPySpark jobsData Mining jobsDistributed Systems jobsRobotics jobsIndustrial jobsBigQuery jobsLooker jobsJira jobsUnstructured data jobsRedshift jobsJenkins jobsE-commerce jobsdbt jobsReact jobsMicroservices jobsPharma jobsData strategy jobsMySQL jobsNumPy jobs