大模型算法工程师(开放域对话)
Tasks
- Accelerate inference with quantization and distillation
- Build end to end dialogue datasets
- Clean and deduplicate raw corpus
- Deploy low latency model in cloud and edge
- Develop LLM algorithms for open domain dialogue
- Evaluate offline and run A B testing
- Improve intent recognition and personalization
- Improve multi turn dialogue planning and decision making
- Optimize models with RLHF
- Optimize prompts with prompt engineering
- Reduce tool use hallucinations
- Run SFT for base models
- Track multi turn dialogue state
- Train reward model datasets for reinforcement learning
Perks/Benefits
- N/A
Skills/Tech-stack
A/B | A/B Testing | AI Feedback | B testing | Dataset cleaning | Deduplication | DeepSpeed | Dialogue State Tracking | Distributed Training | Function Calling | GRPO | Human Feedback | Inference acceleration | Knowledge Distillation | Language Models | Large Language Models | Learning from Human Feedback | Offline evaluation | OpenRLHF | PPO | Prompt engineering | Python | Quantization | React | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Reward Modeling | SFT | State tracking | Thought Intermediate Result | Tool use | User Simulator | VLLM | World Model
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
具身智能-强化学习(灵巧操作方向) 实习生 CNY 25K-37KActor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matchingEntry-level Full Time Internship深圳3h ago
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海5h ago
-
算法工程师-大模型数据方向 CNY 240K-360KAutomated Evaluation | Clustering | Corpus Synthesis | Data Augmentation | Data GovernanceSenior-level Full Time上海5h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent governance | Data Governance | Data Quality | Data Quality Metrics | ETLMid-level Full Time上海5h ago
-
Mid-level Full Time上海5h ago
-
Senior-level Full Time上海5h ago
-
Mid-level Internship上海5h ago
-
Mid-level Full Time上海5h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Evaluation | Function CallingEntry-level Full Time上海、北京5h ago
-
Senior-level Full Time上海5h ago
-
Associate Director, Data and Analytics CNY 280K-360KApache Airflow | Automated testing | BigQuery | CI/CD | Cloud ComposerMid-level Full TimeGuangzhou, Guangdong, China13h ago
-
Entry-level Full TimeSuzhou, Jiangsu, China15h ago
-
AWS | Access Controls | Agile | Azure | CI/CDCareer growth opportunities | Continuous training | High-end technology access | Inclusive workplaceMid-level Full TimeCHN – Chengdu - Commercial, China23h ago
-
Senior System Software Engineer, Robotics CNY 144K-240KARM architecture | C# | C++ | CUDA | DeterminismSenior-level Full TimeChina, Shanghai23h ago
-
C plus plus | C# | Camera Calibration | Camera Synchronization | Camera systemsMid-level Full TimeShenzhen, Guangdong, China23h ago
-
Machine Learning Engineer CNY 216K-300KAndroid | C# | C++ | Embedded Systems | Inference OptimizationMid-level Full TimeShanghai, Shanghai, China23h ago
-
C plus plus | CUDA | Code generation | Compiler design | Domain-specific languageSenior-level Full TimeChina, Shanghai1d ago
-
Senior-level Full Time上海1d ago
-
Mid-level Full Time深圳1d ago
-
Mid-level Full Time东莞1d ago
-
Ai算法工程师 CNY 180K-300KConvolutional Neural Networks | Data Mining | Data Warehouse | Data cleaning | Data labelingMid-level Full Time东莞1d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R1d ago
-
AI院-GLM团队-AI-Native 全栈工程师(偏后端) CNY 180K-300KAPI Design | API design and implementation | Cloud Native | Data Processing | Database operationsMid-level Full Time北京1d ago
-
Mid-level Full Time杭州1d ago
-
AI院--训练Infra工程师 CNY 180K-300KComputer Vision | Distributed Training | Language Models | Language Processing | Large Language ModelsMid-level Full Time北京1d ago