大语言模型后训练/Agentic算法工程师
Tasks
- Build LLM agent systems
- Build training and interaction environments
- Conduct error analysis and optimization
- Deploy agent capabilities in vehicle products
- Design reward modeling and preference learning
- Develop agent RL training pipelines
- Implement tool calling and multi tool reasoning
- Improve model reliability using real data
- Optimize LLM post training
- Run offline evaluation and online analysis
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic RL | DAPO | Distributed Training | Evaluation | Function Calling | GRPO | JSON | Java | Long Horizon Planning | Machine Learning | Memory | Multi-turn dialogue | Multimodal | NLP | On Policy | On policy Distillation | PPO | Planning | Policy Distillation | Preference Learning | Python | RLHF | RLVR | React | Reflection | Reinforcement Learning | Reward Modeling | Sparse Reward | Tool-Calling | TypeScript | Virtual Environment
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Regions
Countries
States
Related jobs
-
具身智能-强化学习(灵巧操作方向) 实习生 CNY 25K-37KActor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matchingEntry-level Full Time Internship深圳1h ago
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海3h ago
-
算法工程师-大模型数据方向 CNY 240K-360KAutomated Evaluation | Clustering | Corpus Synthesis | Data Augmentation | Data GovernanceSenior-level Full Time上海3h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent governance | Data Governance | Data Quality | Data Quality Metrics | ETLMid-level Full Time上海3h ago
-
Mid-level Full Time上海3h ago
-
Senior-level Full Time上海4h ago
-
Mid-level Internship上海4h ago
-
Mid-level Full Time上海4h ago
-
Senior-level Full Time上海4h ago
-
Associate Director, Data and Analytics CNY 280K-360KApache Airflow | Automated testing | BigQuery | CI/CD | Cloud ComposerMid-level Full TimeGuangzhou, Guangdong, China12h ago
-
Entry-level Full TimeSuzhou, Jiangsu, China13h ago
-
AWS | Access Controls | Agile | Azure | CI/CDCareer growth opportunities | Continuous training | High-end technology access | Inclusive workplaceMid-level Full TimeCHN – Chengdu - Commercial, China22h ago
-
Senior System Software Engineer, Robotics CNY 144K-240KARM architecture | C# | C++ | CUDA | DeterminismSenior-level Full TimeChina, Shanghai22h ago
-
C plus plus | C# | Camera Calibration | Camera Synchronization | Camera systemsMid-level Full TimeShenzhen, Guangdong, China22h ago
-
Machine Learning Engineer CNY 216K-300KAndroid | C# | C++ | Embedded Systems | Inference OptimizationMid-level Full TimeShanghai, Shanghai, China22h ago
-
C plus plus | CUDA | Code generation | Compiler design | Domain-specific languageSenior-level Full TimeChina, Shanghai1d ago
-
Senior-level Full Time上海1d ago
-
Mid-level Full Time深圳1d ago
-
Mid-level Full Time东莞1d ago
-
Ai算法工程师 CNY 180K-300KConvolutional Neural Networks | Data Mining | Data Warehouse | Data cleaning | Data labelingMid-level Full Time东莞1d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R1d ago
-
AI院-GLM团队-AI-Native 全栈工程师(偏后端) CNY 180K-300KAPI Design | API design and implementation | Cloud Native | Data Processing | Database operationsMid-level Full Time北京1d ago
-
Mid-level Full Time杭州1d ago
-
AI院--训练Infra工程师 CNY 180K-300KComputer Vision | Distributed Training | Language Models | Language Processing | Large Language ModelsMid-level Full Time北京1d ago
-
MaaS-SRE/DBA CNY 240K-480KAuto Scaling | Backup and Restore | Caching | Capacity Planning | Disaster RecoveryOn-call rotation | Regular incident drillsSenior-level Full Time北京1d ago