Find jobs in AI/ML, Data Science and Big Data
1 result
for Reinforcement Learning From Human Feedback Optimization
(Skill/Tech stack)
-
Senior Applied AI Researcher (Brazil) BRL 271K-370KCI/CD | DPO | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeBrazil/Remote R7d ago