aijobs.net

大模型Post-Training 算法工程师

上海

Mid-level Full Time

Apply Save
Found 6h ago
Tasks
Perks/Benefits
Skills/Tech-stack

DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine Learning | Mixture of Experts | PPO | Preference optimization | PyTorch | Python | RLAIF | RLHF | Reinforcement Learning | Reinforcement learning algorithms | SFT | Tool Using | Transformer

Education

Master of Engineering | Master of Science

Roles

AI | AI Research Engineer | Engineer | Learning Engineer | Machine Learning Engineer | Research Engineer

Regions

Asia/Pacific

Countries

China

States

Shanghai, CN

Cities

Shanghai, Shanghai, CN

Apply Save
Language: zh Views: 1 Clicks: 0 Saves: 0

Related jobs