Principal AI Research Scientist Post-Training Alignment
AMER - Canada - Ontario - Toronto - University Ave
CAD 123K-180K (estimate) Senior-level Full Time
Tasks
- Build scalable reproducible post training workflows with infrastructure teams
- Communicate technical risks limitations and trade offs to leadership
- Create evaluation frameworks for long horizon reasoning tool use agent behavior and safety
- Design experiments to improve model behavior robustness and reasoning quality
- Design novel algorithms for model reliability controllability and alignment
- Develop post training methods for foundation models
- Drive human in the loop evaluation and high quality annotation
- Establish model readiness criteria and provide go no go recommendations for releases
- Lead model analysis and interpretability efforts
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic AI | Alignment research | DPO | Deep learning | Distributed Training | Experiment design | Foundation Models | Human-in-the-loop | Language Models | Large Language Models | Machine Learning | Model Evaluation | Model Interpretability | PPO | Preference Learning | RLAIF | RLHF | Reinforcement Learning | Reinforcement Learning for Foundation Models | The Loop | Training Infrastructure
Education
Related jobs
-
Financial Crime Data Scientist CAD 81K-120KApache Spark | Azure | Data Modeling | Databricks | Generative AIFlexible vacation | Group insurance | Health and wellness reimbursement | Pension plan | TelemedicineMid-level Full Time1, Complexe Desjardins, Montréal, Canada1d ago
-
Financial Crime Data Scientist (Temporary) CAD 69K-98KAlert prioritization | Apache Spark | Azure | Data Modeling | DatabricksDefined benefit pension | Flexible vacation | Group insurance | Health and wellness reimbursement | TelemedicineMid-level Full Time TemporaryComplexe Desjardins Montréal, Canada1d ago
-
Senior-level Full TimeToronto, ON, Canada1d ago
-
Principal Data Scientist CAD 159K-309KCD4ML | Computer Vision | Data Integrity | Data Mining | Data PreprocessingCareer development support | Learning and development programs | MentorshipSenior-level Full TimeToronto, Canada3d ago
-
Senior Data Scientist CAD 120K-160KAzure DevOps | BERT | BigQuery | CI/CD | Champion ChallengerCareer development | Flexible paid time off | Health and dental coverage | Learning opportunities | Remote work environmentSenior-level Full TimeToronto, Ontario, Canada - Remote R4d ago
-
ARIMA | Accelerated Failure Time | Bayesian Inference | Bayesian Marketing Mix Modeling | Boosted ModelsHybrid work modelSenior-level Full TimeCanada - Mississauga4d ago
-
Sr. Data Scientist CAD 111K-160KArtificial Intelligence | Cybersecurity | Data Analysis | Fraud Detection | Language ProcessingSenior-level Full TimeToronto, Canada4d ago
-
Entry-level Full TimeToronto, Canada4d ago
-
Senior Data Scientist CAD 120K-150KAnomaly Detection | Bayesian optimization | Causal Inference | Constrained optimization | Counterfactual InferenceHybrid work environment | Professional developmentSenior-level Full TimeCalgary, Canada4d ago
-
Agent-based | Agent-based systems | Amazon SageMaker | Azure ML | Cloud Machine LearningDental insurance | Flexible sick leave | Flexible vacation | Health insurance | Home-office equipmentSenior-level Full TimeCanada4d ago
-
Data Scientist II CAD 91K-140KAgile | Data Modeling | Data benchmarking | Databricks | Entity ResolutionEntry-level Full TimeVancouver, Canada5d ago
-
Senior Data Scientist – Supply Chain Operations CAD 99K-132KAnomaly Detection | Data Pipelines | Databricks | ERP | HeuristicsSenior-level Full TimeMississauga, ON, CAN - 2300 Meadowvale …5d ago
-
Agentic AI | Artificial Intelligence | Data Science | Decision Support Systems | Decision supportCorporate discounts | Development opportunities | Employee pension plan | Flexible work environment | On-site gymSenior-level Full TimeToronto, ON, Canada5d ago
-
Data Scientist, Product Analytics CAD 80K-110KA/B | A/B Testing | Amazon Redshift | Apache Superset | B testingBirthday day off | Parental leave program | Remote-first | Travel up to 3 months | Work from home stipendMid-level Full TimeCanada - Remote R5d ago
-
Oliver Wyman - Data & Analytics - Analyst - Toronto CAD 70K-107KCredit Risk | Credit risk modeling | Data Modeling | Data analytics | GenAIFlexible work schedule | Hybrid work | International travel | Training and mentorship | Work-life balanceEntry-level Full TimeToronto - Bremner, Canada R6d ago
-
Senior Data Scientist - Personal Lines Datahub CAD 94K-115KComputer Vision | Data Governance | Data Mining | Data Quality | GitHubDefined benefit pension | Employee stock purchase plan | Extra paid time off | Flexible work arrangements | Hybrid work modelSenior-level Full TimeMontréal, 2020 Robert-Bourassa, Canada6d ago
-
AI Engineer- Decision Science CAD 82K-154KAWS | Agentic AI | Anomaly Detection | Apache Spark | Artificial IntelligenceAccident insurance | Health insurance | Life insurance | Retirement savings plan | Tuition reimbursementSenior-level Full TimeBMOPLACE, Canada6d ago
-
Director, Decision Science - Canada CAD 132K-231KArtificial Intelligence | Bureau data | Credit Bureau Data | Credit Risk | Credit bureauAccident insurance | Health insurance | Life insurance | Retirement savings plan | Tuition reimbursementExecutive-level Full TimeBMOPLACE, Canada6d ago
-
Agile | Angular | Automated testing | Azure OpenAI | CI/CDEmployee assistance program | Health benefits | Hybrid work environment | Paid time offSenior-level Full TimeMontreal 111, Canada R6d ago
-
Senior Data Scientist USD 140K-180KAmazon SageMaker | Azure Machine Learning | Docker | EC2 | GitCompany-sponsored events | Dental insurance | Flexible sick days | Flexible vacation days | Home-office equipmentSenior-level Full TimeRemote/US & Canada R6d ago
-
Lead Data Scientist USD 160K-200KAWS | Computer Vision | Docker | EC2 | Experiment trackingCompany-sponsored events | Dental insurance | Flexible sick days | Flexible vacation | Home-office equipmentSenior-level Full TimeRemote/US & Canada R6d ago
-
Senior Data Scientist CAD 111K-160KApache Spark | Azure | Data Analysis | Databricks | Experimental DesignSenior-level Full TimeVancouver, Canada7d ago
-
Artificial Intelligence | Data Science | English | Machine Learning | Technical documentationFlexible hours | Internship program | Onsite workEntry-level Full Time InternshipCA-QC-LONGUEUIL-J01 ~ 1000 Blvd Marie-Victorin ~ …7d ago
-
Senior AI Applied Scientist II CAD 150K-177KAgentic AI | Biomarker Extraction | Classification | Contrastive Learning | Deep learningFlexible vacation policy | Free whole body scans for team members | Health, dental, vision coverage | Mental health coverage | RRSP accessSenior-level Full TimeVancouver, British Columbia, Canada7d ago
-
A/B | A/B Testing | AWS SageMaker | Airflow | Attribution ModelingEmployee and family assistance program | Health and dental benefits | In-store discount | Learning and development opportunities | Paid vacationSenior-level Full TimeToronto, ON, M5V 1X6, CAN7d ago