aijobs.net

大语言模型后训练/Agentic算法工程师

上海、北京

CNY 180K-360K (estimate) Entry-level Full Time

Apply Save
Found 9h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Distributed Training | Function Calling | GRPO | Human Feedback | JSON | Java | Language Processing | Large Language Model | Large Language Model Agents | Learning from Human Feedback | Machine Learning | Memory | Multi Tool Calling | Multi-turn dialogue | Natural Language | Natural Language Processing | On Policy | On policy Distillation | PPO | Planning | Policy Distillation | Preference Learning | Python | RLHF | React | Reflection | Reinforcement Learning | Reinforcement Learning from Human Feedback | Reinforcement Learning from Video Feedback | Reward Modeling | Tool Integrated Reasoning | Tool-Calling | TypeScript | Video Feedback

Education

Master of Science | PhD

Roles

Agent Engineer | Engineer | LLM Agent Engineer | Learning Engineer | Machine Learning Engineer

Regions

Asia/Pacific

Countries

China

States

Shanghai, CN | Beijing, CN

Cities

Shanghai, Shanghai, CN | Beijing, Beijing, CN

Apply Save
Language: zh Views: 2 Clicks: 0 Saves: 0

Related jobs