AI Research Engineer - Reinforcement Learning
Tasks
- Build simulation environments and training datasets
- Debug and optimize reinforcement learning pipelines
- Define evaluation frameworks and monitor deployed systems
- Design and implement reinforcement learning algorithms
- Integrate reinforcement learning agents into production systems
- Run controlled experiments and evaluate benchmarks
Perks/Benefits
Skills/Tech-stack
Actor-critic | Data Analysis | Experiment design | Exploration/exploitation | Policy Optimization | Policy gradients | PyTorch | Reinforcement Learning | Sample efficiency | Simulation | Training Pipeline
Education
Related jobs
-
CNN | Computer Vision | Data Pipelines | Deep learning | Entity recognitionCareer development opportunities | Fully remote work flexibility | International collaborative environment | Learning and continuous improvement cultureMid-level Full TimeFrance R1d ago
-
Senior-level Full TimeFrance, Remote; Germany, Remote; Netherlands, Remote; … R1d ago
-
Senior-level Full TimeFrance, Remote R4d ago
-
AI Engineer EUR 0K-0KAgentic Systems | Deep learning | Fine Tuning | Language Models | Language ProcessingHybrid work | Paid internship | Paid trial project | Remote work within FranceEntry-level Full TimeParis, IDF, FR / Paris, Île-de-France, … R4d ago
-
Ingénieur ML/LLM - Paris - H/F EUR 26K-28KAWS | Data Pipelines | Fine Tuning | GCP | Hugging FaceBike parking | Company restaurants and cafeteria | Flexible Mobility Opportunities | Health insurance | Meal allowanceEntry-level Full TimeParis, IDF, France R5d ago
-
Freelance Data Science Engineer (Python & SQL) USD 116K-116KBig Data | Big data processing | Data Analysis | Data Ingestion | Data ProcessingEnglish submission requirement C1 plus | Flexible weekly hours | Freelance project-based work | Part-time availabilityMid-level FreelanceFrance - Remote R5d ago
-
Machine Learning Developer (Freelance) USD 116K-116KLangchain | MLOps | Model Deployment | NumPy | PandasPaid per completed tasks | Part-time project work | Project-based engagementMid-level FreelanceFrance - Remote R5d ago
-
Freelance Machine Learning Engineer USD 116K-116KLLMs | Langchain | MLOps | Machine Learning | NumPyFlexible hours | Part-time availability | Project based workMid-level FreelanceFrance - Remote R5d ago
-
Consultant(e) Senior Data Science & IA EUR 50K-60KAPI Development | AWS | Azure | CI/CD | Cloud ComputingRemote work | Training opportunities | Work with international teamsSenior-level Full TimeParis, IDF, France R6d ago
-
AI Engineer (F/H/X) EUR 30K-60KAPIs | AWS | Batching | CI/CD | CNNEmployee welfare committee | Health insurance | Meal benefits | Retirement benefits | TeleworkMid-level Full TimeFR - Antony Headquarters, France R7d ago
-
ML Engineer (W/M/D) EUR 51K-65KAPIs | AWS SageMaker | Azure ML | CI/CD | DVCAdditional paid days off | Family care policy | Flexible work environment | Free yoga lessons | Health insuranceMid-level Full TimeParis, IDF, France R15d ago
-
Senior Backend Engineer | Python - Celery | IA & Machine Learning | Paris ou Remote Partiel A EUR 55K-65KAzure | Celery | Django | Docker | ElasticsearchBSPCE equity | Health insurance | Meal vouchers | RTT | Telework 2 days per weekSenior-level Full TimeParis, France R25d ago
-
AI Software Engineer EUR 85K-102KAI Security | Agile | Autogen | Autonomous Agents | Bias MitigationCo-working space reimbursement | Conference attendance | Gym membership | Home office support | Medical coverageMid-level Full TimeParis, Remote, FR R1mo ago
-
Senior Machine Learning Engineer EUR 60K-80KLLMs | Language Processing | LightGBM | Machine Learning | Model DeploymentFree books | Paid sabbatical | Stock options | Team building eventsSenior-level Full TimeRemote (France) R1mo ago
-
AWS GCP Azure | CI/CD | Cloud Platforms | Cloud Platforms (AWS | Cloud platforms AWS GCPConference support | Fully remote | Health benefits | Language courses discount | Paid vacationMid-level Full TimeFrance R1mo ago
-
AI infrastructure | Data Analysis | Experimentation | Language Models | Large Language ModelsAnnual bonus | Global retreats | Learning stipend | Medical coverage | Paid time offSenior-level Full TimeFrance R1mo ago
-
Data Analysis | Generative AI | Langchain | MLOps | Machine LearningFlexible hours | Professional development | Project impact | Remote workMid-level FreelanceFrance - Remote R1mo ago
-
Data Analysis | Data Processing | Generative AI | LLMs | LangchainFlexible schedule | Part-time engagement | Remote work | Skill developmentMid-level FreelanceFrance - Remote R1mo ago