Find jobs in AI/ML, Data Science and Big Data
9 results
for Preference Learning
(Skill/Tech stack)
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KDistributed Training | Function Calling | GRPO | Human Feedback | JSONEntry-level Full Time上海、北京3d ago
-
A/B | A/B Testing | Auction optimization | B testing | Causal InferenceCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipEntry-level Full TimeNew York, NY, USA14d ago
-
A/B | A/B Testing | B testing | Causal Inference | Human FeedbackCommute subsidy | Comprehensive health insurance | Disability insurance | Employee assistance program | Employee resource groupsEntry-level Full TimeBellevue, WA, USA14d ago
-
A/B | A/B Testing | B testing | Causal Inference | Experimentation infrastructureCommute subsidy | Employee resource groups | Employee stock ownership | Generous vacation | Global employee assistance programEntry-level Full TimeMountain View, CA, USA14d ago
-
Principal Machine Learning Engineer USD 114K-234KAttention Mechanism | Benchmarking | CI/CD | Cloud Architecture | ContainersDental insurance | Health insurance | Life insurance | Paid parental leave | Paid sick leaveSenior-level Full TimeUnited States28d ago
-
Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD) USD 202K-368KCoding | Data Construction | Fine Tuning | Instruction Tuning | Language ModelsInternshipEntry-level Full TimeSan Jose, California, United States1mo ago
-
Principal AI Research Scientist Post-Training Alignment CAD 123K-180KAgentic AI | Alignment research | DPO | Deep learning | Distributed TrainingSenior-level Full TimeAMER - Canada - Ontario - …1mo ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R1mo ago
-
Automatic Speech Recognition | Deep learning | Diffusion Models | Knowledge Distillation | Language Models401k matching | Disability insurance | Family-forming benefits | Flexible time off | Health insuranceMid-level Full TimeLos Gatos, United States1mo ago