Find jobs in AI/ML, Data Science and Big Data
3 results
for Group Relative Policy Optimization
(Skill/Tech stack)
-
Applied Scientist , Amazon Customer Service USD 142K-222KAgentic AI | Artificial Intelligence | Dataset curation | Direct Preference Optimization | Embedding ModelsMid-level Full TimeSanta Clara, California, USA27d ago
-
Senior Applied AI Manager USD 170K-234KAgent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixingSenior-level Full TimeSan Mateo, CA1mo ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago