具身智能-强化学习(灵巧操作方向) 实习生
Tasks
- Design and train reward models
- Design reinforcement learning training plans on real robots
- Evaluate training results with benchmarks
- Handle real world delays noise and hardware nonlinearities
- Improve generalization and robustness in unstructured environments
- Integrate foundation models with reinforcement learning
- Iterate using real robot data
- Perform supervised fine tuning and reinforcement learning
- Track and review cutting edge embodied intelligence papers
Perks/Benefits
- N/A
Skills/Tech-stack
Actor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matching | Multimodal Learning | Offline Reinforcement Learning | Online Reinforcement Learning | Policy Optimization | Proximal Policy Optimization | PyTorch | Python | Reinforcement Learning | Robot Learning | Robotics Kinematics | Robotics dynamics | Soft Actor Critic
Education
Related jobs
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海7h ago
-
算法工程师-大模型数据方向 CNY 240K-360KAutomated Evaluation | Clustering | Corpus Synthesis | Data Augmentation | Data GovernanceSenior-level Full Time上海7h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent governance | Data Governance | Data Quality | Data Quality Metrics | ETLMid-level Full Time上海7h ago
-
Mid-level Full Time上海7h ago
-
Senior-level Full Time上海7h ago
-
Mid-level Internship上海7h ago
-
Mid-level Full Time上海7h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Evaluation | Function CallingEntry-level Full Time上海、北京8h ago
-
Senior-level Full Time上海8h ago
-
Associate Director, Data and Analytics CNY 280K-360KApache Airflow | Automated testing | BigQuery | CI/CD | Cloud ComposerMid-level Full TimeGuangzhou, Guangdong, China15h ago
-
Entry-level Full TimeSuzhou, Jiangsu, China17h ago
-
AWS | Access Controls | Agile | Azure | CI/CDCareer growth opportunities | Continuous training | High-end technology access | Inclusive workplaceMid-level Full TimeCHN – Chengdu - Commercial, China1d ago
-
Senior System Software Engineer, Robotics CNY 144K-240KARM architecture | C# | C++ | CUDA | DeterminismSenior-level Full TimeChina, Shanghai1d ago
-
C plus plus | C# | Camera Calibration | Camera Synchronization | Camera systemsMid-level Full TimeShenzhen, Guangdong, China1d ago
-
Machine Learning Engineer CNY 216K-300KAndroid | C# | C++ | Embedded Systems | Inference OptimizationMid-level Full TimeShanghai, Shanghai, China1d ago
-
C plus plus | CUDA | Code generation | Compiler design | Domain-specific languageSenior-level Full TimeChina, Shanghai1d ago
-
Senior-level Full Time上海1d ago
-
Mid-level Full Time深圳1d ago
-
Mid-level Full Time东莞1d ago
-
Ai算法工程师 CNY 180K-300KConvolutional Neural Networks | Data Mining | Data Warehouse | Data cleaning | Data labelingMid-level Full Time东莞1d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R1d ago
-
AI院-GLM团队-AI-Native 全栈工程师(偏后端) CNY 180K-300KAPI Design | API design and implementation | Cloud Native | Data Processing | Database operationsMid-level Full Time北京1d ago
-
Mid-level Full Time杭州1d ago
-
AI院--训练Infra工程师 CNY 180K-300KComputer Vision | Distributed Training | Language Models | Language Processing | Large Language ModelsMid-level Full Time北京1d ago
-
Mid-level Full Time北京1d ago