aijobs.net

大模型算法工程师(开放域对话)

上海、北京

CNY 180K-300K (estimate) Mid-level Internship

Apply Save
Found 7h ago
Tasks
Perks/Benefits
Skills/Tech-stack

A/B | A/B Testing | Agentic reinforcement learning | B testing | DeepSpeed | DisTaTch State Tracking DST | Distributed Training | Function Calling | Group Relative Policy Optimization | Group Relative Policy Optimization GRPO | Knowledge Distillation | LLM | Language Models | Large Language Models | Online Experimentation | OpenRLHF | Prompt engineering | Proximal Policy Optimization | Proximal Policy Optimization PPO | Python | Quantization | RLAIF | RLHF | React | Reinforcement Learning | Reward Modeling | Supervised Fine Tuning | Supervised Fine-Tuning (SFT) | Thought Intermediate Result | VLLM

Education

Bachelor of Arts | Bachelor of Engineering | Bachelor of Science

Roles

Engineer | Language Model Engineer | Large Language Model Engineer | Learning Engineer | Machine Learning Engineer | Model Engineer

Regions

Asia/Pacific

Countries

China

States

Shanghai, CN | Beijing, CN

Cities

Shanghai, Shanghai, CN | Beijing, Beijing, CN

Apply Save
Language: zh Views: 1 Clicks: 0 Saves: 0

Related jobs