(Senior) Machine Learning Engineer - Reinforcement Learning
Tasks
- Build scalable ML training systems
- Collaborate with cross functional teams to improve metrics
- Create repeatable training and evaluation loops
- Design data and evaluation systems using RL from human preferences
- Design reward and preference objectives
- Develop simulation aligned evaluation workflows
- Implement reinforcement learning algorithms
- Monitor and optimize production models
- Own production ML for fleet scale assessment
- Ship deep learning solutions for driving behavior
Perks/Benefits
- Family leave
- Free food and snacks
- Health care plan
- Life insurance
- Long-term disability
- Paid time off
- Retirement plan
- Short-term disability
Skills/Tech-stack
Data Processing | Deep learning | Distributed Training | Generative Models | Human Feedback | Language Models | Large Language Models | Large Scale Data | Large-scale | Learning from Human Feedback | Machine Learning | Offline Reinforcement Learning | Online Reinforcement Learning | PyTorch | Python | Reinforcement Learning | Reinforcement Learning from Human Feedback | Reward Modeling | Sequence Modeling | Simulation | Vision Language Models | Vision-language
Education
Related jobs
-
Machine Learning Engineer CNY 300K-380KArtifact tracking | Data Lineage | Data Pipelines | Distributed Systems | DockerFitness Events | Free meals | Hybrid working | Paid time off | Volunteer opportunitiesMid-level Full TimeShanghai, China7h ago
-
机器学习平台研发工程师/专家 CNY 240K-360KDebugging | Distributed Training | Docker | Elastic scaling | Fault ToleranceSenior-level Full Time北京、上海14h ago
-
机器人 Vln 大模型导航-实习生 CNY 25K-37KArtificial Intelligence | C++ | CUDA | Computer Vision | Data PipelinesOnsite workEntry-level Internship北京15h ago
-
Entry-level Internship南京15h ago
-
Entry-level Internship南京15h ago
-
Entry-level Internship南京15h ago
-
nlp算法工程师-2027届 CNY 25K-37KDeep learning | DeepSpeed | Information Retrieval | Intent Recognition | Language ProcessingInternshipEntry-level Internship武汉15h ago
-
Entry-level Full Time上海15h ago
-
Entry-level Internship深圳、上海16h ago
-
Entry-level Internship深圳16h ago
-
Entry-level Internship北京16h ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | LLM Agent | Machine Learning | PyTorch | RLHFConference participation | Internship experience | Research mentorshipEntry-level Internship深圳16h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GRPOEntry-level Internship深圳16h ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KCLIP | Deep learning | LLaVA | Language Models | Large Language ModelsEntry-level Internship深圳16h ago
-
AI Agent 开发实习生(通用智能仿真方向) CNY 25K-37KAPI | API Integration | Agent architecture | Agent systems | Asynchronous programmingEntry-level Internship广州16h ago
-
Apache Airflow | Apache Spark | Automated testing | Data Lakes | Data WarehousesCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipSenior-level Full TimeShanghai, China1d ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China1d ago
-
Senior Software Engineer (RAG Backend Developer) CNY 120K-180KA/B | A/B Testing | ABAC | Audit Logging | B testingSenior-level Full TimeGuangzhou, Guangdong, China R1d ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China1d ago
-
Magnetic Recording Algorithm Development Engineer CNY 150K-240KAlgorithm Development | Automated Test | Automated Test Equipment | C# | C++Senior-level Full TimeShenzhen, Guangdong Province, China1d ago
-
Assistant Manager, Data Platform Delivery CNY 300K-406KARMA | Amazon SageMaker | Association rule | Association rule learning | AzureMid-level Full TimeChina - Guangzhou1d ago
-
Mid-level Full TimeShanghai, Shanghai, China1d ago
-
Senior-level Full TimeShenyang - PIC, China1d ago
-
Senior-level Full TimeShenyang - PIC, China1d ago
-
Mid-level Full Time深圳2d ago