Researcher - Reinforcement Learning and LLM Reasoning
Edmonton, Alberta, Canada
Huawei Technologies Canada Co., Ltd.
Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices.Our team has a 12-month contract opening for a Researcher.
Responsibilities:
- Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
- Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
- Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
- Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.
Requirements
What you'll bring to the team:
- PhD or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
- Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
- Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
- Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
- Proficient programming skills in Python and strong experience with model development and optimization.
- Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
- Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
4
0
0
Categories:
Machine Learning Jobs
Research Jobs
Tags: Computer Science Deep Learning ICLR ICML LLMs Machine Learning Mathematics ML models NeurIPS NLP PhD Python PyTorch Reinforcement Learning Research RLHF TensorFlow
Perks/benefits: Conferences
Region:
North America
Country:
Canada
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Sr. Data Engineer jobsBusiness Intelligence Developer jobsPower BI Developer jobsBI Developer jobsStaff Data Scientist jobsStaff Machine Learning Engineer jobsPrincipal Software Engineer jobsData Science Intern jobsDevOps Engineer jobsJunior Data Analyst jobsData Science Manager jobsSoftware Engineer II jobsData Manager jobsData Analyst Intern jobsLead Data Analyst jobsStaff Software Engineer jobsBusiness Data Analyst jobsAI/ML Engineer jobsAccount Executive jobsSr. Data Scientist jobsData Specialist jobsData Governance Analyst jobsSenior Backend Engineer jobsBusiness Intelligence Analyst jobsData Engineer III jobs
Consulting jobsMLOps jobsAirflow jobsOpen Source jobsEconomics jobsLinux jobsKPIs jobsKafka jobsTerraform jobsJavaScript jobsGitHub jobsData Warehousing jobsPostgreSQL jobsRDBMS jobsNoSQL jobsScikit-learn jobsStreaming jobsComputer Vision jobsClassification jobsBanking jobsPrompt engineering jobsPhysics jobsGoogle Cloud jobsRAG jobsOracle jobs
Pandas jobsHadoop jobsdbt jobsBigQuery jobsScala jobsR&D jobsLooker jobsData warehouse jobsGPT jobsReact jobsScrum jobsLangChain jobsPySpark jobsDistributed Systems jobsELT jobsMicroservices jobsIndustrial jobsCX jobsJira jobsSAS jobsRedshift jobsOpenAI jobsModel training jobsTypeScript jobsJenkins jobs