Researcher - Reinforcement Learning and LLM Reasoning
Edmonton, Alberta, Canada
Huawei Technologies Canada Co., Ltd.
Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices.Our team has a 12-month contract opening for a Researcher.
Responsibilities:
- Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
- Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
- Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
- Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.
Requirements
What you'll bring to the team:
- PhD or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
- Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
- Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
- Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
- Proficient programming skills in Python and strong experience with model development and optimization.
- Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
- Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Machine Learning Jobs
Research Jobs
Tags: Computer Science Deep Learning ICLR ICML LLMs Machine Learning Mathematics ML models NeurIPS NLP PhD Python PyTorch Reinforcement Learning Research RLHF TensorFlow
Perks/benefits: Conferences
Region:
North America
Country:
Canada
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Principal Data Engineer jobsData Engineer II jobsPrincipal Data Scientist jobsWriter - Freelance AI Tutor jobsContent writer - Freelance AI Tutor jobsData Manager jobsData Science Manager jobsData Scientist II jobsJunior Data Analyst jobsResearch Scientist jobsBusiness Data Analyst jobsSenior Machine Learning Researcher jobsBI Analyst jobsSr Data Engineer jobsSr. Data Scientist jobsSoftware Engineer, Machine Learning jobsData Science Intern jobsLead Data Analyst jobsBusiness Intelligence Engineer jobsJunior Data Engineer jobsJunior Data Scientist jobsData Analyst II jobsSenior AI Engineer jobsAzure Data Engineer jobsCopywriter - Freelance AI Tutor jobs
Data governance jobsSnowflake jobsLinux jobsHadoop jobsOpen Source jobsRDBMS jobsBanking jobsJavaScript jobsPhysics jobsMLOps jobsKafka jobsScala jobsData Warehousing jobsComputer Vision jobsNoSQL jobsGoogle Cloud jobsAirflow jobsSAS jobsOracle jobsData warehouse jobsLooker jobsKPIs jobsData Mining jobsPostgreSQL jobsStreaming jobs
R&D jobsClassification jobsCX jobsScikit-learn jobsGitHub jobsTerraform jobsScrum jobsDistributed Systems jobsPandas jobsPySpark jobsJira jobsIndustrial jobsBigQuery jobsRedshift jobsReact jobsRobotics jobsUnstructured data jobsJenkins jobsMySQL jobsMicroservices jobsdbt jobsMatlab jobsE-commerce jobsData strategy jobsPharma jobs