aijobs.net

Principal AI Research Scientist Post-Training Alignment

Canada R

A CAD 140K-180K (estimate) Senior-level Full Time

Apply Save
Found 22h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI Feedback | Agentic Workflows | Alignment research | Controllability | Direct Preference Optimization | Distributed Training | Evaluation | Experimentation | Human Feedback | Interpretability | Language Models | Large Language Models | Learning from Human Feedback | Long-horizon reasoning | Machine Learning | Model Readiness | Policy Optimization | Preference Learning | Preference optimization | Proximal Policy Optimization | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Safety | Tool use

Education

PhD

Roles

AI Research Scientist | Principal | Principal AI Research Scientist | Research Scientist | Scientist

Regions

North America

Countries

Canada

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs