Find jobs in AI/ML, Data Science and Big Data
5 results
for OpenRLHF
(Skill/Tech stack)
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Function Calling | GRPOEntry-level Full Time上海、北京3h ago
-
Developer Advocate – Reinforcement Learning USD 152K-287KApplication development | Discord | Forums | Instruction following | JAXEmployee benefits | Equity | Travel for conferencesMid-level Full TimeUS, CA, Santa Clara, United States4d ago
-
大模型算法工程师(开放域对话) CNY 180K-300KDPO | Deep learning | DeepSpeed | Distributed Training | Function CallingInternshipMid-level Internship上海9d ago
-
Entry-level Internship上海27d ago
-
Automation | Deep learning | DeepSpeed | Fine Tuning | Hugging FaceContinuous learning opportunities | Networking opportunities | Professional development | Work from homeEntry-level Full TimeAsia1mo ago