Find jobs in AI/ML, Data Science and Big Data
3 results
for Group Relative Policy Optimization
(Skill/Tech stack)
-
Senior Applied AI Manager USD 170K-234KAgent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixingSenior-level Full TimeSan Mateo, CA9d ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States11d ago
-
Automated testing | Cryptography | Direct Preference Optimization | Distributed Systems | DockerSenior-level Full TimeRemote R22d ago