Find jobs in AI/ML, Data Science and Big Data
15 results
for Offline Reinforcement Learning
(Skill/Tech stack)
-
Sr. Physical AI Research Scientist CAD 140K-180KAI alignment | Artificial Intelligence | Computer Vision | Constitutional AI | Continual LearningHybrid work scheduleSenior-level Full TimeToronto, ON, CA4d ago
-
具身智能-强化学习(灵巧操作方向) 实习生 CNY 25K-37KActor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matchingEntry-level Full Time Internship深圳7d ago
-
Machine Learning Engineer - Reinforcement Learning USD 150K-250KData Processing | Deep learning | Distributed Training | Evaluation metrics | Generative ModelsDental insurance | Family leave | Free food and snacks | Health insurance | Life insuranceSenior-level Full TimeFremont, California, United States12d ago
-
Machine Learning Researcher - RL and Agentic Systems USD 190K-287KAgentic Systems | Benchmarking | Data Validation | Dataset Quality Evaluation | Dataset qualityMid-level Full TimeRemote R12d ago
-
Robot Learning Engineering Intern USD 110K-110KBehavior Cloning | Computer Vision | Data Annotation | Data Ingestion | Force controlCatered lunches | Employee assistance program | Flexible work arrangements | Healthy snacks | Paid parental leaveEntry-level InternshipOnsite- Pittsburgh, PA13d ago
-
Lead Data Scientist SGD 120K-135KActor-critic | C++ | Experiment tracking | GRPC | Hyperparameter TuningGlobal team collaboration | Occasional travel for conferences and collaborationsSenior-level Full TimeSingapore19d ago
-
Applied Scientist, Wayve Labs USD 147K-213KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | LanguageDaily yoga | Enhanced parental leave | Flexible working hours | Hybrid working | Large Social BudgetsMid-level Full TimeSunnyvale23d ago
-
Applied Scientist, Wayve Labs CAD 100K-132KAutoregressive models | Computer Vision | Data sets | Depth Estimation | Diffusion ModelsDaily yoga | Enhanced parental leave | Flexible working hours | Large Social Budgets | Onsite barMid-level Full TimeVancouver25d ago
-
Applied Scientist, Wayve Labs GBP 80K-96KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | Human FeedbackDaily yoga | Enhanced parental leave | Flexible working hours | Onsite bar | Onsite chefMid-level Full TimeLondon25d ago
-
Data Processing | Deep learning | Distributed Training | Generative Models | Human FeedbackFamily leave | Free food and snacks | Health care plan | Life insurance | Long-term disabilitySenior-level Full Time费利蒙29d ago
-
Applied Scientist, Customer Behavior Analytics USD 142K-193KCounterfactual analysis | Deep learning | Econometrics | Generative Models | Language ModelsMid-level Full TimeSeattle, Washington, USA1mo ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
Senior Principal Data Scientist (Fulfilment) SGD 224K-252KDecision Processes | DeepSpeed | Distributed Training | Dynamic Models | FSDPBirthday leave | Flexible work arrangements | Life insurance | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
Group Data Scientist INR 2500K-3500KAblation Studies | CP-SAT | Cloud Computing | Constraint Programming | Contextual BanditsSenior-level Full TimeBangalore, Karnataka, India1mo ago
-
Robotics & Reinforcement Learning Engineer EUR 60K-84KActor-critic | Actuator modeling | Behavior Cloning | C++ | Control SystemsAnnual leave | Early Friday finish | Flexible working hours | Free coffee and tea | Permanent full-time contractSenior-level Contract Full TimeBarcelona, CT, Spain1mo ago