AI Research Engineer - Reinforcement Learning
Tasks
- Build RL experiments
- Create simulation environments
- Define success metrics
- Develop reinforcement learning algorithms
- Document experimental findings
- Evaluate RL experiments
- Improve policy performance
- Integrate RL agents into production
- Monitor deployed RL systems
- Optimize RL pipelines
- Resolve exploration strategy issues
- Resolve policy divergence issues
Perks/Benefits
- Career growth opportunities
- Flexible work culture
- Fully remote
- Global collaboration opportunities
- Innovation-focused work culture
Skills/Tech-stack
Actor-critic | Convergence analysis | Deep learning | Exploration/exploitation | Language Processing | Machine Learning | Model Optimization | Natural Language | Natural Language Processing | Policy Optimization | Policy gradients | PyTorch | Reinforcement Learning | Reward engineering | Sample efficiency
Education
Related jobs
-
APIs | Anomaly Detection | Data Modeling | Data Pipelines | Docker100 percent remote work | Autonomous work environment | Career growth | Flexible work environment | International team cultureMid-level Full TimeHungary R1d ago
-
Accelerate | Deep learning | Diffusers | EEG | FMRIAutonomy and ownership | Career growth | Continuous learning | Flexible, distributed work environment | Fully remote workMid-level Full TimeHungary R2d ago
-
AWS | AWS Bedrock | AWS SageMaker | Cloud Platforms | Computer VisionFlexible scheduling | Fully remote | International team culture | Paid training opportunities | Professional development supportMid-level Full TimeHungary R6d ago
-
SpiceUp: The Data Engineering Academy by DATAPAO HUF 10800K-10800KAWS | Apache Spark | Azure | Cloud Migration | DatabricksEAP access | Employee assistance program | Flexible PTO | Hybrid work | Learning accessEntry-level Full TimeBudapest R8d ago
-
Data Engineer, Associate HUF 9178K-14294KAutomated testing | DBT | Data Modeling | ELT | ETLEducation reimbursement | Family support | Flexible time off | Health support | Hybrid work modelMid-level Full TimeBU3-Budapest-GTC White House, Vaci ut 47, … R20d ago
-
AI Engineering Team Lead HUF 11840K-17760KCloud Architecture | Data Pipelines | Deep learning | GenAI | LLMFlexible work hours | Hybrid work | Remote work | Well-being supportSenior-level Full TimeHungary Remote R21d ago
-
AI Inference | AWS | Apache Iceberg | Artificial Intelligence | AzureContinued Career Development | Employee resource groups | Flexible WFH | Generous PTO | Paid volunteer timeSenior-level Full TimeHungary-Remote R22d ago
-
Machine Learning Engineer HUF 8202K-10600KAWS | Amazon SageMaker | Apache Spark | Data Preprocessing | GCPBreakfast fruits and lunch | Career development | Company mobile phone | Fitness-wellness allowance | Hybrid workMid-level Full TimeBudapest R29d ago
-
Senior Modelling Data Scientist, Vice President HUF 10627K-17818KAgentic Workflows | Azure | Benchmarking | CI/CD | Data ModelingEducation reimbursement | Family support programs | Flexible time off | Health benefits | Hybrid work modelSenior-level Full TimeBU3-Budapest-GTC White House, Vaci ut 47, … R29d ago
-
Staff Machine Learning Engineer HUF 10627K-17818KComputer Vision | Deep learning | Language Processing | Machine Learning | Machine Learning SystemCompany paid sick time | Flexible hours | Hybrid work options | Medical benefits after 90 days | Paid parental leaveSenior-level Full TimeBudapest, Hungary (Hybrid) R1mo ago
-
AI Agents | AI Inference | AWS | Apache Iceberg | Artificial IntelligenceCareer development | Employee resource groups | Flexible work from home | Generous PTO | Paid volunteer timeSenior-level Full TimeHungary-Budapest R1mo ago
-
A/B | A/B Testing | Agent Frameworks | Android Automotive | Android Automotive OSMid-level Full TimeHybrid, Budapest, Hungary, Karlsruhe, Germany R1mo ago