aijobs.net

Principal AI Research Scientist Post-Training Alignment

AMER - Canada - Ontario - Toronto - University Ave

CAD 123K-180K (estimate) Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Agentic AI | Alignment research | DPO | Deep learning | Distributed Training | Experiment design | Foundation Models | Human-in-the-loop | Language Models | Large Language Models | Machine Learning | Model Evaluation | Model Interpretability | PPO | Preference Learning | RLAIF | RLHF | Reinforcement Learning | Reinforcement Learning for Foundation Models | The Loop | Training Infrastructure

Education

PhD

Roles

AI Research Scientist | Principal | Principal AI Research Scientist | Research Scientist | Scientist

Regions

North America

Countries

Canada

States

Ontario, CA

Cities

Toronto, Ontario, CA

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs