具身智能-强化学习(灵巧操作方向) 实习生
Tasks
- Design and train reward models
- Design reinforcement learning training plans on real robots
- Evaluate training results with benchmarks
- Handle real world delays noise and hardware nonlinearities
- Improve generalization and robustness in unstructured environments
- Integrate foundation models with reinforcement learning
- Iterate using real robot data
- Perform supervised fine tuning and reinforcement learning
- Track and review cutting edge embodied intelligence papers
Perks/Benefits
- N/A
Skills/Tech-stack
Actor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matching | Multimodal Learning | Offline Reinforcement Learning | Online Reinforcement Learning | Policy Optimization | Proximal Policy Optimization | PyTorch | Python | Reinforcement Learning | Robot Learning | Robotics Kinematics | Robotics dynamics | Soft Actor Critic
Education
Related jobs
-
机器学习平台研发工程师/专家 CNY 240K-360KDebugging | Distributed Training | Docker | Elastic scaling | Fault ToleranceSenior-level Full Time北京、上海10h ago
-
机器人 Vln 大模型导航-实习生 CNY 25K-37KArtificial Intelligence | C++ | CUDA | Computer Vision | Data PipelinesOnsite workEntry-level Internship北京11h ago
-
Entry-level Internship南京11h ago
-
Entry-level Internship南京11h ago
-
Entry-level Internship南京11h ago
-
nlp算法工程师-2027届 CNY 25K-37KDeep learning | DeepSpeed | Information Retrieval | Intent Recognition | Language ProcessingInternshipEntry-level Internship武汉11h ago
-
Entry-level Full Time上海11h ago
-
Entry-level Internship深圳、上海12h ago
-
Entry-level Internship深圳12h ago
-
Entry-level Internship北京12h ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | LLM Agent | Machine Learning | PyTorch | RLHFConference participation | Internship experience | Research mentorshipEntry-level Internship深圳12h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GRPOEntry-level Internship深圳12h ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KCLIP | Deep learning | LLaVA | Language Models | Large Language ModelsEntry-level Internship深圳12h ago
-
AI Agent 开发实习生(通用智能仿真方向) CNY 25K-37KAPI | API Integration | Agent architecture | Agent systems | Asynchronous programmingEntry-level Internship广州12h ago
-
Apache Airflow | Apache Spark | Automated testing | Data Lakes | Data WarehousesCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipSenior-level Full TimeShanghai, China23h ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China23h ago
-
Senior Software Engineer (RAG Backend Developer) CNY 120K-180KA/B | A/B Testing | ABAC | Audit Logging | B testingSenior-level Full TimeGuangzhou, Guangdong, China R1d ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China1d ago
-
Magnetic Recording Algorithm Development Engineer CNY 150K-240KAlgorithm Development | Automated Test | Automated Test Equipment | C# | C++Senior-level Full TimeShenzhen, Guangdong Province, China1d ago
-
Assistant Manager, Data Platform Delivery CNY 300K-406KARMA | Amazon SageMaker | Association rule | Association rule learning | AzureMid-level Full TimeChina - Guangzhou1d ago
-
Mid-level Full TimeShanghai, Shanghai, China1d ago
-
Senior-level Full TimeShenyang - PIC, China1d ago
-
Mid-level Full Time深圳2d ago
-
Mid-level Full Time深圳2d ago
-
Senior-level Full TimeShanghai, CN, 2012033d ago