Find jobs in AI/ML, Data Science and Big Data
15 results
for Offline Reinforcement Learning
(Skill/Tech stack)
-
Applied Scientist, Wayve Labs USD 147K-213KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | LanguageDaily yoga | Enhanced parental leave | Flexible working hours | Hybrid working | Large Social BudgetsMid-level Full TimeSunnyvale2d ago
-
Applied Scientist, Wayve Labs CAD 100K-132KAutoregressive models | Computer Vision | Data sets | Depth Estimation | Diffusion ModelsDaily yoga | Enhanced parental leave | Flexible working hours | Large Social Budgets | Onsite barMid-level Full TimeVancouver5d ago
-
Applied Scientist, Wayve Labs GBP 80K-96KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | Human FeedbackDaily yoga | Enhanced parental leave | Flexible working hours | Onsite bar | Onsite chefMid-level Full TimeLondon5d ago
-
Sr. Staff Software Engineer, Machine Learning USD 191K-315KContent Safety | Evaluation Pipelines | Fine Tuning | Incident Response | Incident monitoringHealth and wellness programs | Hybrid work environment | Time away from workSenior-level Full TimeMountain View, CA, United States5d ago
-
Sr. Staff Software Engineer, Machine Learning USD 191K-315KContent Safety | Deep learning | Evaluation Pipelines | Fine Tuning | Harm TaxonomyHealth and wellness programs | Time away from workSenior-level Full TimeMountain View, CA, United States7d ago
-
Data Processing | Deep learning | Distributed Training | Generative Models | Human FeedbackFamily leave | Free food and snacks | Health care plan | Life insurance | Long-term disabilitySenior-level Full Time费利蒙9d ago
-
Applied Scientist, Customer Behavior Analytics USD 142K-193KCounterfactual analysis | Deep learning | Econometrics | Generative Models | Language ModelsMid-level Full TimeSeattle, Washington, USA16d ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore20d ago
-
Senior Principal Data Scientist (Fulfilment) SGD 224K-252KDecision Processes | DeepSpeed | Distributed Training | Dynamic Models | FSDPBirthday leave | Flexible work arrangements | Life insurance | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore20d ago
-
Group Data Scientist INR 2500K-3500KAblation Studies | CP-SAT | Cloud Computing | Constraint Programming | Contextual BanditsSenior-level Full TimeBangalore, Karnataka, India28d ago
-
Robotics & Reinforcement Learning Engineer EUR 60K-84KActor-critic | Actuator modeling | Behavior Cloning | C++ | Control SystemsAnnual leave | Early Friday finish | Flexible working hours | Free coffee and tea | Permanent full-time contractSenior-level Contract Full TimeBarcelona, CT, Spain1mo ago
-
Helix AI Engineer, Reinforcement Learning USD 150K-350KCredit Assignment | Distributed Training | Experiment Management | Exploration | Model-based reinforcement learningIn-office collaborationSenior-level Full TimeSan Jose, CA1mo ago
-
Senior Software Engineer, AI Networking USD 152K-287KBash | Bayesian optimization | C++ | Data Curation | Data Curation PipelinesSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Research Intern – Reinforcement Learning (RL) INR 300K-420KAgent systems | Fine Tuning | LLM Fine-tuning | Language Processing | Learning environmentsEntry-level InternshipNoida1mo ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R1mo ago