(Senior) Machine Learning Engineer - Reinforcement Learning
Tasks
- Build scalable ML training systems
- Collaborate with cross functional teams to improve metrics
- Create repeatable training and evaluation loops
- Design data and evaluation systems using RL from human preferences
- Design reward and preference objectives
- Develop simulation aligned evaluation workflows
- Implement reinforcement learning algorithms
- Monitor and optimize production models
- Own production ML for fleet scale assessment
- Ship deep learning solutions for driving behavior
Perks/Benefits
- Family leave
- Free food and snacks
- Health care plan
- Life insurance
- Long-term disability
- Paid time off
- Retirement plan
- Short-term disability
Skills/Tech-stack
Data Processing | Deep learning | Distributed Training | Generative Models | Human Feedback | Language Models | Large Language Models | Large Scale Data | Large-scale | Learning from Human Feedback | Machine Learning | Offline Reinforcement Learning | Online Reinforcement Learning | PyTorch | Python | Reinforcement Learning | Reinforcement Learning from Human Feedback | Reward Modeling | Sequence Modeling | Simulation | Vision Language Models | Vision-language
Education
Related jobs
-
【集团】Nlp算法工程师 CNY 240K-360KAddress Parsing | Algorithm Design | BERT | GIS | Geographic Information SystemsEntry-level Full Time上海4h ago
-
Mid-level Full Time北京 R6h ago
-
多模态大模型算法工程师(偏Llm) CNY 500K-500KAI system engineering | Algorithm Optimization | Computer Vision | Deep learning | Fine TuningMid-level Full Time北京7h ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480K5G | API Integration | Classifier Training | Claude 3 | Claude 3 5 APIHybrid workSenior-level Full Time北京 R7h ago
-
Audit Logging | CI/CD | Data Governance | Data Privacy | Drift DetectionSenior-level Full TimeShanghai, Shanghai, China15h ago
-
Senior AI Engineer CNY 240K-480KAgent Orchestration | Authentication | Authorization | CI Gates | CI/CDSenior-level Full TimeChina18h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Friendly work environment | Hands-off management | Remote/distributed workMid-level Full TimeShanghai, China20h ago
-
Artificial Intelligence | C# | C++ | Computer Architecture | GStreamerSenior-level Full TimeChina Shanghai1d ago
-
Forward Deployed AI Engineer CNY 37K-37KAWS | Agile | Azure | BigQuery | Cloud ComputingTravel opportunitiesEntry-level Full Time Internship北京1d ago
-
Mid-level Full Time北京1d ago
-
Mid-level Full Time北京1d ago
-
Mid-level Full Time杭州1d ago
-
Entry-level Full Time深圳、上海1d ago
-
Entry-level Full Time深圳、上海、北京、中国香港1d ago
-
【26届校招】大语言模型后训练算法工程师(Foundation Model) CNY 240K-480KData loading | Distributed Training | Docker | Fine Tuning | Inference OptimizationEntry-level Full Time上海、深圳1d ago
-
Agent数据工程师-2026届 CNY 25K-37KArtificial Intelligence | Data Governance | Data Structures | Data Structures and Algorithms | Data WarehousingEntry-level Internship北京、上海1d ago
-
Agent 服务端开发实习生(AI Agent / AI App) CNY 37K-37KContainerization | Cpluspluplus | Dify | Distributed Systems | GoEntry-level Internship北京、上海1d ago
-
Entry-level Full Time上海1d ago
-
数据算法工程师(实习生) CNY 25K-37KAnomaly Filtering | C++ | Data Generation | Data Processing | Data cleaningInternshipEntry-level Internship上海1d ago
-
Entry-level Full Time北京 R1d ago
-
Agent 全栈研发工程师(前/后端)-MiMo CNY 180K-300KAPI Design | Authentication | Authorization | Browser Automation | CI/CDEntry-level Full Time北京1d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KContainerd | Distributed Systems | Docker | ELK | File SystemMid-level Full Time北京 R1d ago
-
机器学习-2026届(Devops) CNY 240K-480KData Preprocessing | Feature Engineering | Hyperparameter Tuning | Machine Learning | Model DeploymentCareer development path | High-performance computing resources | Internship opportunity | Mentor coaching | System training planSenior-level Internship重庆、北京1d ago
-
Entry-level Full Time北京1d ago
-
高级影像高级算法工程师(博士) CNY 240K-480KC++ | Computer Vision | Deep learning | Face Recognition | Image RecognitionSenior-level Full TimeShenzhen1d ago