Find jobs in AI/ML, Data Science and Big Data
10 results
for Offline Reinforcement Learning
(Skill/Tech stack)
-
Helix AI Engineer, Reinforcement Learning USD 150K-350KCredit Assignment | Distributed Training | Experiment Management | Exploration | Model-based reinforcement learningIn-office collaborationSenior-level Full TimeSan Jose, CA18h ago
-
Senior Software Engineer, AI Networking USD 152K-287KBash | Bayesian optimization | C++ | Data Curation | Data Curation PipelinesSenior-level Full TimeUS, CA, Santa Clara, United States4d ago
-
Decision Intelligence Engineer - Next Best Action USD 129K-177KA3C | Backtesting | Bellman Equation | Conservative Q Learning | Constraint Mapping401k retirement savings plan | Medical, dental, and vision benefits | Occasional travel | Remote work | Time offSenior-level Full TimeRemote US, United States R7d ago
-
AI Research Scientist Intern (PhD), Embodied AI USD 93K-180KAction models | Deep learning | Diffusion Models | Fine Tuning | Imitation LearningIn office collaboration 5 days per week | Publication opportunitiesEntry-level InternshipMilpitas, CA9d ago
-
Research Intern – Reinforcement Learning (RL) INR 300K-420KAgent systems | Fine Tuning | LLM Fine-tuning | Language Processing | Learning environmentsEntry-level InternshipNoida10d ago
-
Agent systems | DPO | Deep learning | Evaluation | Fine TuningMid-level Full TimeBellevue, Washington, USA15d ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R15d ago
-
Sr. Staff AI Engineer, GenAI Safety USD 191K-315KAbuse prevention | Content Safety | Evaluation Pipelines | Fine Tuning | Incident ResponseHealth and wellness programs | Time away from workSenior-level Full TimeMountain View, CA, United States15d ago
-
Robot Learning Engineering Intern USD 80K-116KBehavior Cloning | Computer Vision | Force control | Imitation Learning | Learning from DemonstrationCatered lunches | Employee Assistance Program (EAP) | Flexible PTO | Healthy snacks | Paid sick leaveEntry-level InternshipOnsite- Pittsburgh, PA16d ago
-
Sr. Staff AI Engineer, GenAI Safety USD 191K-315KAI Safety | Content Safety | Distillation | Distributed Systems | Evaluation PipelineHealth and wellness programs | Time offSenior-level Full TimeMountain View, CA, United States22d ago