Machine Learning Researcher - RL and Agentic Systems
Tasks
- Automate dataset validation environment generation rollout analysis and benchmark construction
- Benchmark model behavior in reinforcement learning and agentic settings
- Build quality scorecards and evaluation methods
- Build scalable evaluation and validation tooling
- Collaborate with research and engineering teams to improve evaluation methodology and best practices
- Connect model failures to dataset environment and task design gaps
- Design and build datasets tasks and environments
- Develop frameworks for evaluating real world data quality
- Evaluate planning tool use robustness recovery and generalization
- Improve infrastructure for reproducible experimentation and evaluation quality
- Translate real world workflows into structured tasks interaction traces trajectories and stateful environments
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic Systems | Benchmarking | Data Validation | Dataset Quality Evaluation | Dataset quality | Decision Making | Evaluation methodology | Experimental Design | Imitation Learning | Large Scale Data | Large-scale | Machine Learning | Offline Reinforcement Learning | Online Reinforcement Learning | Quality evaluation | RLAIF | RLHF | Reinforcement Learning | Reproducible experimentation | Reward Modeling | Sequential decision making | Simulation Environments | Task design
Education
Related jobs
-
Associate AI Engineer USD 144K-180K.NET | APIs | ASPNet | AWS | Azure401k matching | Dental insurance | Hybrid work model | Medical insurance | Paid time offMid-level Full TimeIrving, TX R11h ago
-
Data Scientist-Costa Rica USD 91K-111KCUDA | Convolutional Neural Networks | Data Transformation | Deep learning | Deep reinforcement learningBereavement | Dental coverage | Disability insurance | Employee assistance program | Employee discount programMid-level Full TimeCosta Rica R13h ago
-
Lead Fraud Data Scientist USD 164K-230KA/B | A/B Testing | AWS | Azure | B testingDental insurance | Health insurance | Hybrid work option | Performance bonus | Remote workSenior-level Full TimeNew York, San Francisco, Miami R14h ago
-
Data Engineer USD 115K-162KAWS | Apache Spark | Azure | Cloud platform | Data VisualizationRemote workMid-level Full TimeRemote R16h ago
-
AI Developer USD 140K-170KAPI Development | Algorithms | Amazon Web Services | Azure | CI/CDAdvanced English communication support | Remote positionMid-level Full TimeRemote R16h ago
-
Agile | Artificial Intelligence | Data Architecture | Data Engineering | Data GovernanceHybrid work environment | Mentorship | Training and development | Work from home daysEntry-level Full TimeMidrand, GP, South Africa R16h ago
-
Data Scientist GenAI USD 111K-123KApache Spark | Cloud Computing | Data Analysis | Data Processing | Data cleaningComprehensive healthcare | Fully remote | International projects | Long-term contract | Multinational environmentMid-level Full TimeRemote R17h ago
-
IT Data Engineer USD 60K-65KANSI X12 | AWS | Azure | Azure Data | Azure Data Factory401k retirement plan | Disability coverage | Employee assistance program | Flexible spending account | Medical/Dental/Vision insuranceMid-level Full TimeUS - Remote R17h ago
-
Cloud Engineer USD 128K-298KAWS | Active Directory | Azure DevOps | Azure Machine Learning | Azure OpenAISenior-level Full TimeFlexible Hybrid R17h ago
-
Senior Specialist - Data Science USD 85K-150KAWS | Backtesting | Business Intelligence | Code Reviews | Data AnalysisSenior-level Full Time2911 Lake Vista Drive, TX, 500 … R17h ago
-
Sr. AI Engineer USD 150K-175KAccess Control | Agentic AI | Auditability | CI/CD | Cloud platform401k | Dental insurance | Expense reimbursement for internet costs | Life insurance | Medical insuranceSenior-level Full TimeRemote, USA, United States R18h ago
-
Data Scientist H/F EUR 15K-21KAWS Bedrock | Agentic AI | Continuous Deployment | Continuous integration | DataikuEntry-level InternshipEurope, France, Ile-de-France, 92 - Hauts-De-Seine R19h ago
-
Senior Data Engineer GBP 72K-80KAWS | AWS Glue | AWS Lambda | Amazon Athena | Amazon KinesisCareer progression | Flexible work-life balance | Inclusive culture | Open holiday policy | Professional developmentSenior-level Full TimeHanoi, Vietnam R20h ago
-
Manager , Data and Analytics INR 800K-2000KAWS Redshift | Agile | Azure | BigQuery | Cloud analyticsMid-level Full TimeBangalore, Karnataka, India R21h ago
-
Senior Marketing Analytics Engineer - Remote AUD 110K-120KBusiness Intelligence | Data Catalog | Data Modeling | Data Visualization | ForecastingEmployee stock options | Learning and development | Parental leave | Remote work | WFH office expense budgetSenior-level Full TimeSydney, New South Wales 2000, Australia R1d ago
-
Analyst, Data Analytics INR 1400K-2500KAI-enabled | AI-enabled analytics | Advanced Analytics | Analytics engineering | AzureComprehensive healthcare | Flexible time off | Hybrid work model | Retirement plan | Support for working parentsMid-level Full TimeMU8-South (A) Wing, 7-10 Floor, Nesco … R1d ago
-
MLOps Engineer GBP 76K-100KAirflow | Automated retraining | Autoscaling | BigQuery | Blue-Green DeploymentDiscounts | Enhanced parental leave | Holiday allowance | Learning and development programme | Life insuranceSenior-level Full TimeUnited Kingdom - Remote R1d ago
-
Senior Manager, Data Scientist (AI/ML & GenAI) INR 2500K-5000KAI Services | API Development | Agentic AI | Azure AI | Azure AI ServicesSenior-level Full TimeIN - Hyderabad, India R1d ago
-
Edge AI Engineer USD 100K-150KBenchmarking | Bias Evaluation | C++ | Core ML | Data PrivacyCareer growth | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAgent systems | Computer Vision | Data Quality | Data quality monitoring | Deep learningMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter methods | Attention Optimization | Benchmarking | Dataset Quality Assurance | Dataset curationCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Engineer, Global Operations USD 107K-143KArtificial Intelligence | ETL | Langchain | Machine Learning | PythonContinuous learning | Flexible work options | Health, wellness, and retirement plans | Professional growth opportunitiesMid-level Full TimeCalifornia - Remote, United States R1d ago
-
Senior Machine Learning Engineer EUR 80K-120KAI Assisted Development | API Development | Agent architectures | Apache Airflow | DBTSenior-level Full TimePortugal - All - Fully Flexible R1d ago
-
Consultant - Quantitative Risk Consulting EUR 28K-28KAnomaly Detection | Artificial Intelligence | Basel | Business Intelligence | CosoApprenticeship contract | Flexible benefits | Health assistance | Hybrid work | Insurance coverageMid-level Full TimeRome - Villa Grazio, Italy R1d ago
-
Machine Learning Lead Analyst - HIH - Evernorth INR 2500K-3500KAWS | Authentication | Azure | C# | C++Cross-functional collaboration | Professional growth | Remote work flexibilitySenior-level Full TimeHIH - Hyderabad, India R1d ago