Find jobs in AI/ML, Data Science and Big Data
2 results
for Off-policy evaluation
(Skill/Tech stack)
-
Agent systems | DPO | Deep learning | Evaluation | Fine TuningMid-level Full TimeBellevue, Washington, USA19d ago
-
Principal Applied Scientist, Agentic AI USD 181K-305KAI Feedback | DPO | Fine Tuning | Human Feedback | Learning from Human FeedbackMentorship and technical leadership | Remote-first work environmentSenior-level Full TimeRemote-USA, United States R22d ago