Machine Learning Researcher - RL and Agentic Systems
Tasks
- Automate dataset validation environment generation rollout analysis and benchmark construction
- Benchmark model behavior in reinforcement learning and agentic settings
- Build quality scorecards and evaluation methods
- Build scalable evaluation and validation tooling
- Collaborate with research and engineering teams to improve evaluation methodology and best practices
- Connect model failures to dataset environment and task design gaps
- Design and build datasets tasks and environments
- Develop frameworks for evaluating real world data quality
- Evaluate planning tool use robustness recovery and generalization
- Improve infrastructure for reproducible experimentation and evaluation quality
- Translate real world workflows into structured tasks interaction traces trajectories and stateful environments
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic Systems | Benchmarking | Data Validation | Dataset Quality Evaluation | Dataset quality | Decision Making | Evaluation methodology | Experimental Design | Imitation Learning | Large Scale Data | Large-scale | Machine Learning | Offline Reinforcement Learning | Online Reinforcement Learning | Quality evaluation | RLAIF | RLHF | Reinforcement Learning | Reproducible experimentation | Reward Modeling | Sequential decision making | Simulation Environments | Task design
Education
Related jobs
-
Principal Applied AI Scientist - Agentic AI USD 190K-210KAgentic Systems | Autogen | Cloud Computing | Data Quality | Data Systems401k match | Flexible schedule | Health insurance | Paid parental leave | Paid time offSenior-level Full TimeWork From Home, United States R7h ago
-
Principal Applied AI Scientist - Predictive AI USD 190K-210KAnomaly Detection | Big Data | Cloud Computing | Data Quality | Data quality assurance401k match | Disability insurance | Employee assistance program | Flexible schedules | Health insuranceSenior-level Full TimeWork From Home, United States R7h ago
-
Principal Applied AI Scientist - Agentic AI USD 190K-210KAccuracy testing | Agentic Systems | Autogen | Cloud Computing | CrewAI401k match | Dental insurance | Employee assistance program | Employee stock purchase plan | Flexible schedulingSenior-level Full TimeWork From Home, United States R7h ago
-
Senior-level Full TimeRemote R7h ago
-
Machine Learning Engineer USD 131K-178KAWS | Cassandra | Convolutional Neural Networks | Data Lakes | Data PipelinesMid-level Full TimeRemote, NY, US R9h ago
-
Senior Data Engineer USD 132K-167KAWS | DBT | Data Modeling | Data Pipeline Monitoring | Data QualityRemote workSenior-level Full TimeRemote R10h ago
-
Software Engineer, Machine Learning USD 213K-293KAPI Design | Agent Orchestration | Artificial Intelligence | Bias Mitigation | C++Senior-level Full TimeSunnyvale, CA | Remote, US | … R12h ago
-
Continuous Delivery | Deep learning | Integration Testing | Java | JavaScriptAnnual offsite | Coworking stipend | Learning time | Monthly in person rituals | Remote workSenior-level Full TimeFrance, France R15h ago
-
AWS | DBT | Data Modeling | Data Pipelines | Data QualityCollaborative environment | End to end data engineering ownership | Engineering quality focus | Fully remote | Latin America location requirementSenior-level Full TimeBrazil R18h ago
-
Senior Data Scientist USD 180K-220KMLOps | Machine Learning | Model Monitoring | NumPy | PandasDental insurance | Flexible vacation | Flexible work hours | Health insurance | Parental leaveSenior-level Full TimeRemote, North America R21h ago
-
Data Scientist USD 150K-180KData Analysis | Drift Detection | GitHub | Machine Learning | Model DeploymentDental insurance | Equipment provided | Flexible vacation | Flexible work hours | Health insuranceMid-level Full TimeRemote, North America R21h ago
-
Staff Machine Learning Engineer USD 189K-389KCalibration | Contextual Bandits | Contextual Decisioning | Data Validation | EmbeddingsEquity eligible | In Office 1 Day Per WeekSenior-level Full TimeSan Francisco, CA, US; Remote, US R22h ago
-
Senior Staff Data Scientist - Consumer Relevance USD 232K-325KCausal Inference | Counterfactual evaluation | Experimental Design | Offline evaluation | Power analysis401k employer match | Caregiving support | Comprehensive healthcare | Family planning support | Flexible vacationSenior-level Full TimeRemote - United States R23h ago
-
Senior-level Full TimeUS - Remote R23h ago
-
Big Data | Feature Engineering | LLM | Langchain | MLOpsPart-time project workMid-level FreelanceSpain - Remote R23h ago
-
Data Analysis | Data Modeling | Data cleaning | Exploratory Data Analysis | Feature EngineeringFlexible part-time schedule | Freelance project-based work | Paid per projectMid-level FreelanceIreland - Remote R23h ago
-
Senior-level Full TimeIndia - Remote R23h ago
-
Gen AI Engineer EUR 64K-80KAI Search | AI orchestration | Agent Framework | Agent-based | Agent-based architectureContinuous learning | Flexible working | Personal equipment | Private health insurance | Professional developmentSenior-level Full TimeAthens, Attica, Greece - Remote R23h ago
-
Applied AI SME GBP 47K-52KAI Governance | Artificial Intelligence | Curriculum Development | Data Engineering | EthicsCollaborative people centered culture | Flexible remote working | Opportunities for growth | Remote work arrangementsMid-level Full TimeLondon, England, United Kingdom - Remote R23h ago
-
Senior Data Scientist — Applied Analytics (Data & AI) USD 118K-195KClustering | DBT | Data Modeling | Data Quality | Data pipeline401k employer match | Paid parental leave | Paid time off and holidays | Tuition reimbursementSenior-level Full TimeRaleigh, United States R23h ago
-
Senior AI/ML Engineer CHF 128K-176KAndroid | C plus plus | C# | Caffe | Computer VisionComprehensive benefits package | Hybrid work model | Work from home optionSenior-level Full TimeLausanne, Switzerland R23h ago
-
Principal AI/ML Engineer USD 165K-226KC# | C++ | CI/CD | CUDA | Computer Vision401k match | Dental insurance | Health insurance | Life insurance | Paid time offSenior-level Full TimeRemote PA - PA PAR, United … R23h ago
-
CAD | CUDA | Co-simulation | Contact mechanics | Controller co simulationSenior-level Full TimeSwitzerland, Remote R23h ago
-
Lead AI Engineer EUR 67K-93KAI | CI/CD | Docker | ETL | KubernetesFlexible work environment | Hybrid work 3 days per week on site | Professional development opportunities | Supportive leadersSenior-level Full TimeDublin - Charlotte, Ireland R23h ago
-
Senior AI Engineer EUR 58K-86KArtificial Intelligence | CI/CD | Docker | ETL | KubernetesHealth insurance | Hybrid work | Professional development opportunities | Supportive leaders | Workplace wellnessSenior-level Full TimeDublin - Charlotte, Ireland R23h ago