Find jobs in AI/ML, Data Science and Big Data
4 results
for Reinforcement learning fine tuning
(Skill/Tech stack)
-
Senior AI Research Scientist USD 139K-221KDAPO | Fine Tuning | GRPO | Language Models | Large Language ModelsSenior-level Full TimeRemote - USA, United States R1d ago
-
Head of World Models (Universal Robots, India) INR 3000K-6000KAI orchestration | Actor-critic | Agent Frameworks | Autogen | DPOExecutive-level Full TimeBangalore, IN9d ago
-
Head of Simulation (Universal Robots, India) INR 3000K-6000KAI orchestration | Actor-Critic methods | Actor-critic | Agent Frameworks | AutogenExecutive-level Full TimeBangalore, IN9d ago
-
Action Chunking | Behavioral cloning | Data Versioning | Diffusion Models | Domain Randomization401k retirement plan | Comprehensive medical, dental and vision coverage | Daily free lunch | Employee referral bonuses | Flexible PTOMid-level Full TimeColumbus, Ohio R23d ago