AI Research Engineer - Reinforcement Learning
Tasks
- Build reinforcement learning experiments
- Define success metrics for deployed RL systems
- Design reinforcement learning algorithms
- Develop simulation environments
- Document experimental findings
- Evaluate reinforcement learning experiments
- Integrate reinforcement learning agents into production systems
- Monitor deployed reinforcement learning systems
- Optimize reinforcement learning pipelines
Perks/Benefits
Skills/Tech-stack
Actor-Critic methods | Actor-critic | Computational Efficiency | Exploration/exploitation | Machine Learning | Multi-Modal | Multi-modal AI | NLP | Online Reinforcement Learning | Policy Divergence | Policy Optimization | Policy gradients | PyTorch | Reinforcement Learning | Reward Optimization | Sample efficiency
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
API | Anomaly Detection | Data Modeling | Data Pipelines | DockerCareer growth | Flexible work environment | Remote workMid-level Full TimeFrance R1d ago
-
Accelerate | Deep learning | Diffusers | Generative AI | Generative ModelsAutonomy and ownership | Career growth opportunities | Continuous learning | Flexible work environment | Fully remote workMid-level Full TimeFrance R1d ago
-
Data Curation | Deep learning | Distributed Training | GPU Computing | JAXAccess to GPU clusters | Flexible work schedule | Fully remote work | Professional growth opportunitiesSenior-level Full TimeFrance R1d ago
-
Data Scientist Lead (H/F) EUR 50K-59KAWS | Agent-based | Agent-based modeling | Attribution Modeling | GCPFlexible telework | Ticket restaurant | Transportation reimbursement | Work-life balanceSenior-level Full TimeParis, Île-de-France, France R2d ago
-
ADMET | Analytical Thinking | Artificial Intelligence | Benchmarking | Business DevelopmentAnnual paid holidays | Co-working stipend | Flexible work location | Home-office allowance | In Person Company GatheringsMid-level Full TimeFrance R4d ago
-
Head of AI - JT AI Labs (M/W/D) EUR 90K-110KArtificial Intelligence | Data Annotation | Data Quality | Data Security | Deep learningAdditional paid leave | Career training budget | Flexible work environment | Health insurance | Holiday bonusExecutive-level Full TimeParis, IDF, France R5d ago
-
Senior ML Engineer - AI Platform & Agents EUR 67K-96KAWS | Agent Orchestration | Agent workflows | Airflow | Amazon BedrockFully remote workSenior-level Full TimeParis, France R6d ago
-
AWS Bedrock | AWS SageMaker | Amazon Web Services | Cloud Computing | Computer VisionFlexible scheduling | Fully remote | Inclusive, diverse team culture | Paid training | Professional development supportMid-level Full TimeFrance R6d ago
-
AWS | Airflow | CloudFormation | CloudWatch | DockerDistributed team | Inclusive workplace culture | Remote workSenior-level Full TimeParis, Île-de-France, France - Remote R6d ago
-
Stage Ingénieur - Prédiction de la réponse de capteurs via une approche hybride Machine Learning / CFD (F/H) EUR 31K-40KCFD | Computational Fluid Dynamics | Data Preprocessing | Dimensionality Reduction | Fluid DynamicsCSE | Concierge services | Employee restaurantEntry-level Full TimeAix-en-Provence, Provence-Alpes-Côte d'Azur, France R6d ago
-
AI Engineer EUR 65K-84KAnthropic | Azure OpenAI | Convexity | Elasticsearch | EvaluationFlexible hours | Full remote work | Gym subscription | Lunch vouchers | Medical insuranceMid-level Full TimeParis, Remote R7d ago
-
Stage septembre 2026 - Data Science et Machine Learning/AI practitionner (H/F/N) (50% client, 50% R&D) EUR 53K-59KData Analysis | Data Visualization | Deep learning | ETL | Exploratory Data AnalysisArt events | Caritative events | Flexible remote work | Great place to work | Modern officesSenior-level Part TimeParis R7d ago
-
Freelance Data Science Engineer (Python & SQL) USD 116K-116KData Modeling | Feature Engineering | LLM | Langchain | MLOpsEnglish proficiency requirement | Part-time project workMid-level FreelanceFrance - Remote R8d ago
-
Freelance Machine Learning Engineer USD 116K-116KLLMs | Langchain | MLOps | Machine Learning | NumPyProject based workMid-level FreelanceFrance - Remote R8d ago
-
Consultant(e) Senior Data Science & IA EUR 50K-60KAPIs | AWS | Azure | CI/CD | Cloud ComputingInternational company | Telework | Training opportunities | Work-life balanceSenior-level Full TimeParis, IDF, France R8d ago
-
Agile | Artificial Intelligence | Big Data | Data Architecture | English communicationCulture of feedback | Freedom and responsibility | Remote work 2 to 3 days per week | Training and knowledge-sharingSenior-level Full TimeParis R12d ago
-
Senior Data Engineer H/F - CDI - Paris EUR 34K-45KAPI Integration | AWS | Apache Airflow | Business Intelligence | CI/CDCareer development | Continuous training | Gym membership | Health insurance | Legal adviceSenior-level Full TimeParis, France R12d ago
-
Ingénieur MLOPS - Nantes - Services Financiers EUR 36K-44KAWS | Azure | Bash | CI/CD | Cloud ComputingEmployee representative council | Health insurance | Meal vouchers | Profit sharing | Referral bonusMid-level Full TimeNantes, Pays de la Loire, France R12d ago
-
Assistant Projets IA & Machine Learning H/F EUR 14K-21KGenerative AI | Jupyter | Language Models | Large Language Models | PyTorchCareer support | HR follow-up | Modern campus services | Remote work optionEntry-level InternshipEurope, France, Ile-de-France, 92 - Hauts-De-Seine R14d ago
-
Ingénieur MLOPS - Nantes - Services Financiers EUR 44K-44KAWS | Azure | Bash | CI/CD | DockerCareer development | Employee representative council | Health insurance | Meal vouchers | Profit-sharing bonusMid-level Full TimeNantes, Pays de la Loire, France R15d ago
-
Senior-level Full TimeFrance, Remote; Germany, Remote; Netherlands, Remote; … R22d ago
-
Senior AI Engineer EUR 61K-88K.Net Core | C# | Deep learning | Git | Hugging FaceAI training and workshops | Collaboration with leadership initiatives | Hybrid work model | Remote work optionSenior-level Full TimeParis R22d ago
-
Data Engineer expérimenté(e) EUR 48K-48KAirflow | Cloud Architecture | DBT | Data Governance | Data QualityContinuous learning | Technical community events | Telework/Hybrid | Training budgetMid-level Full TimeLille R24d ago
-
Senior-level Full TimeFrance, Remote R25d ago
-
AI Engineer EUR 0K-0KAgentic Systems | Deep learning | Fine Tuning | Language Models | Language ProcessingHybrid work | Paid internship | Paid trial project | Remote work within FranceEntry-level Full TimeParis, IDF, FR / Paris, Île-de-France, … R25d ago