Find jobs in AI/ML, Data Science and Big Data
3 results
for Reward Hacking
(Skill/Tech stack)
-
Software Engineer, RL Data USD 320K-485KAPIs | Cloud infrastructure | Command Line | Command-line Interface | Data EngineeringCompetitive benefits | Flexible work policy | Flexible working hours | Generous vacation | Parental leaveSenior-level Full TimeLondon, UK; San Francisco, CA | …20d ago
-
Audio Processing | Autoregression | Autoregressive models | Computer Vision | Deep learningRemote workSenior-level Full TimeRemote job R1mo ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY1mo ago