Find jobs in AI/ML, Data Science and Big Data
9 results
for Preference Learning
(Skill/Tech stack)
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Evaluation | Function CallingEntry-level Full Time上海、北京6d ago
-
AI Feedback | Agentic Workflows | Alignment research | Controllability | Direct Preference OptimizationAdvanced AI research tooling | Flexible work arrangements | Health dental and wellness coverage | Hybrid work | Large scale compute accessSenior-level Full TimeCanada R6d ago
-
Principal Machine Learning Engineer USD 114K-234KAttention Mechanism | Benchmarking | CI/CD | Cloud Architecture | ContainersDental insurance | Health insurance | Life insurance | Paid parental leave | Paid sick leaveSenior-level Full TimeUnited States7d ago
-
Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD) USD 202K-368KCoding | Data Construction | Fine Tuning | Instruction Tuning | Language ModelsInternshipEntry-level Full TimeSan Jose, California, United States11d ago
-
Principal AI Research Scientist Post-Training Alignment CAD 123K-180KAgentic AI | Alignment research | DPO | Deep learning | Distributed TrainingSenior-level Full TimeAMER - Canada - Ontario - …13d ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R14d ago
-
Automatic Speech Recognition | Deep learning | Diffusion Models | Knowledge Distillation | Language Models401k matching | Disability insurance | Family-forming benefits | Flexible time off | Health insuranceMid-level Full TimeLos Gatos, United States1mo ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States1mo ago
-
Applied Scientist, AGI Customization Services USD 142K-193KAmazon SageMaker | Continued Pre Training | Dataset creation | Experiment design | Extended Post TrainingMid-level Full TimeCambridge, Massachusetts, USA1mo ago