Director, Reinforcement Learning & Agentic Post-Training

Paris, France

EUR 151K-200K (estimate) Executive-level Full Time

Apply Save

Found 27d ago

Build agent training and reinforcement learning environments
Create training data generation and preference pipelines
Design evaluation frameworks for multi step workflows
Develop reward models and verifiers
Improve policies using supervised fine tuning preference optimization and reinforcement learning
Lead reinforcement learning strategy for LLM agents
Measure tool call correctness and workflow completion
Mentor machine learning engineers and review technical designs
Partner with product and domain experts to build trainable agent environments
Set engineering standards for experiment tracking and reproducibility

N/A

Apply Save

Language: en Views: 9

Clicks: 0

Saves: 0