AI Research Engineer - Reinforcement Learning
Tasks
- Build simulation environments
- Curate training datasets
- Debug reinforcement learning pipelines
- Define evaluation frameworks
- Design reinforcement learning algorithms
- Evaluate models against benchmarks
- Improve exploration strategy
- Integrate RL agents into production systems
- Monitor deployed systems
- Optimize policy optimization
- Run controlled experiments
- Stabilize reward and training convergence
Perks/Benefits
Skills/Tech-stack
Actor-critic | Audio Processing | Benchmarking | Deep learning | Experiment design | Exploration/exploitation | Image Processing | Multimodal Learning | Online Reinforcement Learning | Policy Optimization | Policy gradients | PyTorch | Reinforcement Learning | Reward engineering | Sample efficiency | Simulation | Text processing | Training Convergence | Training data
Education
Related jobs
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Exposure to advanced AI frameworks | Fully remote | Global distributed work environment | High ownership and impactMid-level Full TimeRomania R2d ago
-
Freelance Data Science Engineer (Python & SQL) USD 116K-116KCustomer Analytics | Feature Engineering | Forecasting | Fraud Detection | LLMPaid per project | Part-time projects | Project based workMid-level FreelanceRomania - Remote R5d ago
-
Machine Learning Developer (Freelance) USD 116K-116KLangchain | MLOps | NumPy | Pandas | Prompt engineeringFlexible part-time schedule | Project based workMid-level FreelanceRomania - Remote R5d ago
-
Freelance Machine Learning Engineer USD 116K-116KDeep learning | LLM | Langchain | MLOps | Machine LearningFlexible schedule | Part-time availability | Project based workMid-level FreelanceRomania - Remote R5d ago
-
A/B | A/B Testing | AWS | Azure | B testingAnnual learning and development budget | Company laptop provided | Fully remote | Home office stipend | Paid Maternity LeaveSenior-level Full TimeRomania R6d ago
-
Software Development Engineer - Adobe Sites Optimizer RON 312K-396KAPIs | AWS | Alerting | Azure | CDNsSenior-level Full TimeBucharest, Romania R14d ago
-
Data Scientist III (Remote, ROU) RON 295K-387KCloud Computing | Data Science | Deep learning | GPU | Generative AIAdoption leave | Employee networks | Paid parental leave | Professional development | Vacation and holidaysSenior-level Full TimeROU Remote, Romania R28d ago
-
Data Analysis | Data Processing | Generative AI | Langchain | MLOpsFlexible schedule | Project variety | Remote | Skill developmentMid-level FreelanceRomania - Remote R1mo ago
-
Data Analysis | Generative AI | Langchain | MLOps | Machine LearningCareer development | Flexible schedule | Project portfolio building | Remote workMid-level FreelanceRomania - Remote R1mo ago