aijobs.net

大模型算法工程师(开放域对话)

上海

CNY 180K-300K (estimate) Mid-level Internship

Apply Save
Found 6h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI Feedback | Agentic tool use | DPO | DST | DeepSpeed | Dialogue State Tracking | Distributed Training | Function Calling | GRPO | Human Feedback | Inference acceleration | LLM | Language Models | Large Language Models | Learning from Human Feedback | Model Distillation | Model Quantization | Multi Turn Dialogue State Tracking | Multi-turn dialogue | PPO | Prompt engineering | Python | RLAIF | RLHF | React | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Reward Modeling | SFT | State tracking | Thought Intermediate Result | Tool use | VLLM

Education

Bachelor of Engineering | Bachelor of Science

Roles

Engineer | Language Model Engineer | Learning Engineer | Machine Learning Engineer | Model Engineer | Research Scientist | Scientist

Regions

Asia/Pacific

Countries

China

States

Shanghai, CN

Cities

Shanghai, Shanghai, CN

Apply Save
Language: zh | Views: 2 | Clicks: 0 | Saves: 0

Related jobs