AI Engineer - Reinforcement Learning
Tasks
- Build data pipelines for training and human feedback collection
- Create evaluation frameworks to measure agent performance
- Design and implement reinforcement learning environments for decision making
- Develop reward functions for agent objectives
- Document training findings and failure modes
- Stay current with industry trends
Perks/Benefits
- N/A
Skills/Tech-stack
Data Pipelines | Evaluation Frameworks | Fine Tuning | Human-in-the-loop | Language model fine-tuning | Large Language Model | Large Language Model Fine-Tuning | Learning environments | Model Fine-tuning | Policy Optimization | PyTorch | Python | RLHF | Reinforcement Learning | Reinforcement Learning Environments | Reward Modeling | Reward shaping | The Loop
Education
N/A
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer
Related jobs
-
APIs | Azure Data | Azure Data Factory | Data Factory | Data Ingestion2 days remote work | 3 days on-site | CDI contract | Hybrid workMid-level Full TimeParis, France R10h ago
-
Tech Lead Cloud & Data Engineer (F/H) EUR 50K-65KAgile | Amazon Redshift | Amazon S3 | Azure | Azure DataEmployee stock ownership plan | Health insurance | Maternity Leave Supplement | Paid vacation bonus | Training opportunitiesSenior-level Full TimeLille, Hauts-de-France, France R10h ago
-
Agile | Apache Hadoop | Apache Spark | CI/CD | Cloud ComputingContinuous learning programs | Employee representative council | Health insurance | Meal vouchers | Profit sharingMid-level Full TimeParis, IDF, France R11h ago
-
AI Engineer (All Genders) EUR 50K-60KAI Safety | API Development | Auditability | Batching | CI/CDCareer development support | Coparental Leave Extended | Fitness subscription | High-end health insurance | Hybrid workMid-level Full TimeIssy-les-Moulineaux, IDF, France12h ago
-
Agile | Apache Hadoop | Apache Hive | Apache Kafka | Apache SparkEmployee representative council | Health insurance | Meal vouchers | Profit sharing | Referral bonusMid-level Full TimeParis, IDF, France R12h ago
-
Apache Hadoop | Apache Hive | Apache Kafka | Apache Spark | Azure HDInsightEmployee Committee | Health insurance | Holiday bonus | Meal vouchers | Profit sharingSenior-level Full TimeCourbevoie, IDF, France R12h ago
-
Consultant·e AI Engineer – Senior / Confirmé·e EUR 50K-60KAzure | Bash | Cloud platform | Foundation Models | Generative AIAnnual certification plan | Annual seminar | Career development | Health insurance 75% | Internal coachingSenior-level Full TimeParis, France13h ago
-
Machine Learning Engineer (W/M/D) EUR 48K-60KAWS SageMaker | Artificial Intelligence | Azure ML | CI/CD | DVCAdditional paid days off | Annual learning budget | Family care policy | Flexible work schedule | Free therapy or coaching sessionsMid-level Full TimeParis, IDF, France R1d ago
-
Apache Airflow | Apache Beam | BigQuery | Cloud Run | Cloud StorageFlexible hours | Fully remote work | Health benefits | Professional development opportunitiesSenior-level Full TimeFrance R1d ago
-
Senior Data Engineer - Real time analytics EUR 69K-88KAWS | AWS Glue | Amazon EKS | Amazon EMR | Amazon KinesisCollaborative team | Commute support | Continuous learning | Health coverage | Lunch allowanceSenior-level Full TimeParis Office1d ago
-
Data Engineer - H/F EUR 50K-58KAWS Glue | Ansible | Azure Data | Azure Data Factory | Azure DevOpsEmployee stock ownership plan | Health insurance | Maternity return support | Paid time off bonus | TeleworkMid-level Full TimeRennes, Brittany, France R1d ago
-
Data Engineer Spark / Scala - H/F EUR 50K-58KAWS Glue | Ansible | Azure Data | Azure Data Factory | Azure DevOpsEmployee share ownership | Health insurance coverage | Maternity leave return with reduced schedule without salary loss | Paid vacation bonus | Training programsMid-level Full TimeNantes, Pays de la Loire, France R1d ago
-
Data Engineer - Débutant H/F EUR 30K-34KAPI | Computer Vision | Data pipeline | Language Models | Language ProcessingEntry-level Full TimePAU-CSTJF(FRA), PAU, France1d ago
-
3D Computer Vision | 3D Geometry | C++ | CUDA | Computer VisionEntry-level Full TimeParis1d ago
-
Agentic AI Intern EUR 25K-25KAutoGPT | CrewAI | Database Design | JavaScript | LangchainPartial remoteEntry-level InternshipSaint-Denis, France1d ago
-
C++ | Calibration | Control | Data Structures | DebuggingAgile environment | Career development | Collaborative culture | Continuous learning | International teamMid-level Full TimeToulouse1d ago
-
Ingénieur / ingénieure Data – IA Générative EUR 60K-70KAWS | AWS Bedrock | Azure DevOps | CI/CD | Copilot StudioCareer development support | Diversity inclusion agreements | Employee share participation | Health insurance | Paid time offSenior-level Full TimeÉchirolles, Auvergne-Rhône-Alpes, France R1d ago
-
API Design | AWS | Agentic Workflows | Backend Development | Cloud platformSenior-level Full TimeParis1d ago
-
Data Engineer GCP (F/H) EUR 50K-55KAPI Development | Agile | BigQuery | CI/CD | Cloud BuildInternal training platform | Professional growth | Supportive management | TeleworkMid-level Full TimeParis, Île-de-France, France R1d ago
-
Data Engineer AWS (F/H) EUR 35K-55KAWS Athena | AWS Glue | AWS Lambda | Amazon S3 | Apache AirflowCooptation bonus | Employee benefits | English courses | Health insurance | Meal ticketSenior-level Full TimeLa Garenne-Colombes, Île-de-France, France R1d ago
-
Data Engineer AWS Senior (F/H) EUR 35K-55KAPI Gateway | AWS Lambda | Amazon API | Amazon API Gateway | Amazon CognitoCommunity events | Cooptation bonus | E-learning access | Employee benefits | English trainingSenior-level Full TimeLa Défense, Île-de-France, France R1d ago
-
Data Engineer Cloud (F/H) EUR 35K-55KAWS | AWS Lambda | AWS RDS | AWS Step Functions | Apache SparkCooptation bonus | Employee benefits | Health insurance | Meal vouchers | Mobility bonusSenior-level Full TimeLa Défense, Île-de-France, France R1d ago
-
C# | Combinatorics | Graph theory | MATLAB | NumPyPart-time project workSenior-level FreelanceFrance - Remote R1d ago
-
Machine Learning Engineer (Post-Training) EUR 57K-84KAWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference OptimizationSenior-level Full TimeParis, France1d ago
-
Algorithms | Antenna systems | MATLAB | Machine Learning | PythonCollaborative team | Company support | Inclusive work environment | Technical documentationEntry-level Apprenticeship Full TimeToulouse Champollion, France1d ago