Find jobs in AI/ML, Data Science and Big Data
1 result
for Trust Region Policy Optimization
(Skill/Tech stack)
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R2d ago