aijobs.net

Research Scientist, Safety Post Training

San Francisco, CA; New York, NY

USD 216K-270K Senior-level Full Time

Apply Save
Found 15h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Adversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human Feedback | Learning from Human Feedback | Machine Learning | Mechanistic Interpretability | Policy Optimization | Preference optimization | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Reward Hacking

Education

N/A

Roles

Research Scientist | Scientist

Regions

North America

Countries

United States

States

New York, US | California, US

Cities

New York City, New York, US | San Francisco, California, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs