AI Research Engineer - Reinforcement Learning
Tasks
- Build simulation environments and training datasets
- Debug and optimize RL training pipelines
- Define evaluation frameworks and monitor deployed systems
- Design and implement reinforcement learning algorithms
- Integrate RL agents into production systems
- Optimize policies for decision making
- Run controlled experiments and evaluate benchmarks
Perks/Benefits
- Flexible working culture
- Fully remote work
- Global team collaboration
- High impact production influence
- Work on cutting-edge AI research
Skills/Tech-stack
Actor-critic | Experiment design | Exploration/exploitation | Model Evaluation | Multimodal Learning | Online Reinforcement Learning | Policy Optimization | Policy gradients | PyTorch | Reinforcement Learning | Reward Optimization | Sample efficiency | Simulation | Text Image Audio | Training pipelines
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
AI systems | Agent systems | Agentic Workflows | Autonomous AI | Autonomous AI systemsCollaborative inclusive environment | Internal mobility | Remote-friendly culture | Work-life balanceSenior-level Full TimeGermany, REMOTE, Germany R1d ago
-
Senior-level Full TimeFrance, Remote; Germany, Remote; Netherlands, Remote; … R1d ago
-
Freelance Data Science Engineer (Python & SQL) USD 116K-116KFeature Engineering | GenAI | LLMs | Langchain | MLOpsFlexible schedule | Freelance work | Part-time availability | Project-based engagementMid-level FreelanceGermany - Remote R5d ago
-
Freelance Machine Learning Engineer USD 116K-116KGenAI | Langchain | Language Models | Large Language Models | MLOpsFlexible schedule | Part-time availability | Project based workMid-level FreelanceGermany - Remote R5d ago
-
Machine Learning Developer (Freelance) USD 116K-116KLLM | Langchain | MLOps | NumPy | PandasPart-time hours | Project based workMid-level FreelanceGermany - Remote R5d ago
-
AI & Quantitative Investment Researcher (f/m/d) EUR 56K-65KAgentic AI | Artificial Intelligence | Data Analysis | Data Ingestion | Data ProcessingCareer development opportunities | Childcare facilities | Company pension | Employee share purchase plan | Hybrid work modelEntry-level Full TimeFrankfurt, DE, 60323 R5d ago
-
A/B | A/B Testing | AWS | Azure | B testingCompany laptop | Fully remote work | Home office stipend | Learning and development budget | Paid Maternity LeaveSenior-level Full TimeGermany R6d ago
-
Senior Machine Learning Engineer EUR 52K-75KAWS | Agile | Amazon SageMaker | CI/CD | Cloud PlatformsCharity donation matching | Complimentary office snacks | Flexible working hours | Health and fitness subsidy | Hybrid work modelSenior-level Full TimeBerlin, BE, Germany R6d ago
-
Junior Consultant Data Science & AI (m/w/d) EUR 36K-36KAdvanced Document Processing | Agentic Workflows | Artificial Intelligence | Cloud services | Data AnalysisAdditional equipment support | Company bike | Company car | Company fitness | Family serviceEntry-level Full Timebundesweit, Germany R9d ago
-
Agent systems | Agentic Workflows | Conversational AI | Data Pipelines | Deep learningCollaborative inclusive environment | Internal mobility | Remote-friendly culture | Work-life balanceSenior-level Full TimeGermany, REMOTE, Germany R12d ago
-
Junior Consultant Data Science & AI (m/w/d) EUR 36K-36KAdvanced Document Processing | Agentic Workflows | Artificial Intelligence | Data Analysis | Data PipelinesAdditional training | Company car | Company fitness | Family service | Hybrid workEntry-level Full Timebundesweit, Germany R12d ago
-
AWS | Apache Airflow | Automation | Bash | CI/CD30 days vacation | Flexible working hours | Health insurance | Remote work option | Sports club membershipMid-level Full TimeBerlin, Germany R14d ago
-
A/B | A/B Testing | Agent Frameworks | Android Automotive | Android Automotive OSMid-level Full TimeHybrid, Budapest, Hungary, Karlsruhe, Germany R17d ago
-
Deep learning | Generative Models | Geometric Deep Learning | Git | Graph Neural NetworksCollaboration with AI and product team | Flexible remote setup | Publication opportunity | Remote work | Thesis mentorshipSenior-level Full TimeGermany - Remote R20d ago
-
Emerging Talent - Working Student/Intern - Computational Imaging (PyTorch) 2026 (Garching) EUR 26K-30KC++ | Differentiable Rendering | Inverse problems | Neural Networks | OptimizationFlexible part-time schedule | Remote work flexibility | Thesis collaborationEntry-level Full Time Internship Part TimeDE-BY-GARCHING-SCHLEISSHEIMER STRABE 92, Germany R21d ago
-
ML Ops Engineer EUR 56K-65KAzure | CI/CD | Database Schema | Docker | GitHub ActionsCareer advancement | Flexible working hours | Portfolio submission | Remote workMid-level Full TimeGermany - Remote R26d ago
-
Generative Machine Learning Engineer EUR 55K-65KAzure | CI/CD | Database Schema | Deep learning | Deep learning pipelinesCareer advancement | Flexible working hours | Portfolio submission | Remote workMid-level Full TimeGermany - Remote R26d ago
-
AWS | Airflow | Data Quality | Data orchestration | DatabricksBVG subsidy | Corporate pension program | Employee stock option plan | Flexible working hours | HackathonsSenior-level Full TimeLiveEO GmbH Berlin Office (Hybrid) R27d ago
-
AWS | Databricks | ELT | ETL | GDALBVG subsidy | Career development | Corporate pension program | Employee stock option program | Flexible working hoursSenior-level Full TimeLiveEO GmbH Berlin Office (Hybrid) R27d ago
-
Agent systems | Autonomous Agents | Conversational AI | Data Ingestion | Deep learningInternal mobility | Remote-friendly culture | Work/life balance focusSenior-level Full TimeGermany, REMOTE, Germany R1mo ago
-
AWS | Computer Vision | Data Processing | Deep learning | DockerFun team events | Growth opportunities | Hybrid work options | Sustainable mobility optionsSenior-level Full TimeMünchen R1mo ago
-
AI Researcher (Early Talent) USD 95K-115KCommunication | Data Mining | Deep learning | Machine Learning | Model EvaluationFlexible schedule | Mentorship | Professional growth | Remote workMid-level Full TimeAmsterdam, Netherlands; Berlin, Germany; Remote - … R1mo ago
-
AI Research Engineer - Reinforcement Learning GBP 110K-140KAI software | AI software deployment | Agent systems | C++ | Control SystemsFamily support | Health and wellness benefits | Learning allowance | Parental leave | Relocation supportSenior-level Full TimeBerlin; London; Munich R1mo ago
-
Autonomous Systems | Deep learning | Language Models | Language Processing | Large Language ModelsCompetitive salaries | Inclusive culture | Remote workSenior-level Full TimeGermany, REMOTE, Germany R1mo ago
-
Agent systems | Autonomous Systems | Chatbots | Data Pipelines | Deep learningCollaborative environment | Inclusive culture | Meaningful projects | Professional growth | Remote workSenior-level Full TimeGermany, REMOTE, Germany R1mo ago