44 jobs for Reward Modeling

Data Science / ML Engineer (AcS) USD 69K-140K

API Design | Data Pipelines | Evaluation | Langchain | Langgraph

100 percent remote

Mid-level Full Time

Remote Latam R

1d ago

Helix AI Engineer, Reinforcement Learning USD 150K-350K

Credit Assignment | Distributed Training | Experiment Management | Exploration | Model-based reinforcement learning

In-office collaboration

Senior-level Full Time

San Jose, CA

1d ago

Helix AI Engineer, Pretraining USD 175K-400K

Computer Vision | Data Mixture Optimization | Deep learning | Distributed Training | Language Processing

Senior-level Full Time

San Jose, CA

1d ago

ML Engineer / Data Scientist A

APIs | Data Pipelines | Distributed Systems | Langchain | Langgraph

Mid-level Full Time

Medellín, Medellín, Antioquia, Colombia, Antioquia, Colombia

2d ago

AI Engineer - Reinforcement Learning EUR 60K-84K

Data Pipelines | Evaluation Frameworks | Fine Tuning | Human-in-the-loop | Language model fine-tuning

Senior-level Full Time

Paris, France

2d ago

Advisor - AI-Guided Optimization for Biologics USD 166K-244K

Active Learning | Antibody Design | Bayesian optimization | Diffusion Models | Distributed Training

Company 401K | Employee assistance program | Fitness benefits | Flexible spending accounts | Life insurance

Mid-level Full Time

US: San Diego CA Lilly Biotechnology …

2d ago

Staff Software Engineer, Generative AI, Core ML USD 207K-300K

AI Feedback | Computer Vision | Data Processing | Deep learning | Digital Twin

Senior-level Full Time

Mountain View, CA, USA

3d ago

Machine Learning Engineer (Post-Training) EUR 57K-84K

AWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference Optimization

Senior-level Full Time

Paris, France

3d ago

Machine Learning Engineer I USD 151K-189K

AWS | Azure | Classification | Cloud Computing | Code review

401k match | Equity | Flexible PTO | Learning stipend | Medical/Dental/Vision insurance

Mid-level Full Time

San Francisco, CA

4d ago

AI Research Scientist - Safety Alignment Team USD 213K-293K

Adversarial prompts | Automation | Computer Vision | DPO | Dataset curation

Senior-level Full Time

Menlo Park, CA

5d ago

Data Scientist（LLM）

C++ | Data acquisition | Deep learning | Fine Tuning | Language Models

Mid-level Full Time

Taiwan, Taipei R

6d ago

Machine Learning Engineer II USD 170K-212K

A/B | A/B Testing | AWS | Agile | Apache Beam

401k retirement plan | Health insurance | Meal allowance | Paid flexible holidays | Paid parental leave

Senior-level Full Time

New York, NY

8d ago

Senior Machine Learning Engineer - Multimodal Data EUR 70K-70K

AWS | Amazon S3 | Annotation tools | Batching | Contamination detection

Equity packages | Flexible leave options | Inclusive parental leave | Vibe and Thrive allowance

Senior-level Full Time

Vienna, Vienna, Austria

9d ago

Applied Scientist III, Alexa Sensitive Content Intelligence (ASCI) USD 167K-226K

Computer Vision | Deep learning | Generative AI | Language Models | Language Processing

Senior-level Full Time

Bellevue, Washington, USA

9d ago

Applied Scientist II, Alexa Sensitive Content Intelligence (ASCI) USD 142K-193K

Automated testing | Deep learning | Distributed Training | Evaluation systems | Information Retrieval

Mid-level Full Time

Bellevue, Washington, USA

9d ago

Senior Machine Learning Engineer (Spain) GBP 70K-100K

API Integration | Agile methodologies | Bias detection | Data Governance | Data Quality

Equal pay guaranteed | Flexible working hours | Hybrid work | International exposure | Multicultural environment

Senior-level Full Time

Cambourne, United Kingdom of Great Britain …

9d ago

Senior Applied Scientist USD 142K-270K

Diffusion Models | Direct Preference Optimization | Fine Tuning | Human Feedback | Inference acceleration

Senior-level Full Time

Seattle, United States

10d ago

Research Intern – Reinforcement Learning (RL) INR 300K-420K

Agent systems | Fine Tuning | LLM Fine-tuning | Language Processing | Learning environments

Entry-level Internship

Noida

11d ago

Applied Scientist II, Alexa Sensitive Content Intelligence (ASCI) USD 142K-193K

Automated Evaluation | Information Retrieval | Language Models | Language Processing | Large Language Models

Mid-level Full Time

Bellevue, Washington, USA

11d ago

AI Research - Scientist/ Engineer USD 245K-350K

Benchmarking | Evaluation Frameworks | Fine Tuning | Language Models | Language Processing

Periodic in-person meetings | Work from home

Mid-level Full Time

Global

12d ago

Staff Applied AI Scientist CNY 200K-500K

Benchmarking | Cost Optimization | DPO | Deep learning | Distillation

Cross-functional collaboration | Direct impact with real customer data | Remote-friendly work

Senior-level Full Time

Shenzhen, Guangdong Province, China

12d ago

Agent RL Infra Engineer USD 224K-356K

AI Feedback | Active Learning | Cluster management | Continuous Learning | Data Curation

Senior-level Full Time

US, CA, Santa Clara, United States

13d ago

Large Model Application Algorithm Research Scientist-International Content Security Algorithm Research

Chain-of-Thought | Data Compliance | Knowledge Distillation | Language Models | Language Processing

Entry-level Full Time

Singapore, Singapore

14d ago

Sr Machine Learning Engineer USD 159K-236K

AWS | Alignment Tuning | Anomaly Detection | Azure | BERT

Financial security support | Flexible hybrid work model | Healthcare coverage | Mental health resources | Paid time off

Senior-level Full Time

USA - California - San Jose …

15d ago

Applied Scientist - Agentic AI, Amazon Fulfillment Technology USD 142K-193K

Agent systems | DPO | Deep learning | Evaluation | Fine Tuning

Mid-level Full Time

Bellevue, Washington, USA

16d ago

Applied Reinforcement Learning Engineer USD 150K-160K

Actor-critic | Agent systems | BCQ | Behavioral cloning | CQL

Equal opportunity employer | Hybrid remote work | Research publications opportunity

Mid-level Full Time

Remote Work( USA), United States R

16d ago

AI Engineer (PhD Required) USD 300K-405K

Architecture Search | Attention Mechanisms | Autogen | Automl | Computer Vision

Autonomy in research | Opportunity to deploy research at scale | Remote work

Mid-level Full Time

Lahore, Punjab

18d ago

AI Engineer (PhD Required) SGD 96K-138K

Attention Mechanisms | Autogen | Chunking | Constitutional AI | Distributed Training

Annual team events | Casual team environment | Flexible hours | Internet reimbursement | Opportunity for advancement

Mid-level Full Time

Singapore, Singapore

18d ago

Senior AI Engineer USD 150K-291K

API Development | Content Moderation | Content Safety | Data pipeline | Distributed Systems

401k | Dental insurance | Flexible vacation policy | Flexible working hours | Health insurance

Senior-level Full Time

Los Altos

18d ago

Research Engineer BRL 200K-220K

Ablation Studies | Code review | Data Curation | Data Validation | Data denoising

5 days per week working | Collaborative team | Flexible working hours | Remote work | Supportive work environment

Mid-level Full Time

Brazil

18d ago

Research Engineer

Ablation Studies | Chart Reading | Code review | Continuous integration | Data Augmentation

Collaborative team | Flexible working hours | Remote work | Supportive work environment

Mid-level Full Time

Colombia, Huila, Colombia

18d ago

Principal Applied Scientist, Agentic AI USD 181K-305K

AI Feedback | DPO | Fine Tuning | Human Feedback | Learning from Human Feedback

Mentorship and technical leadership | Remote-first work environment

Senior-level Full Time

Remote-USA, United States R

19d ago

LLM Post-Training Engineer, Research & Product USD 212K-389K

Data Pipelines | Deep learning | Distributed Training | Human preference learning | Instruction Tuning

Senior-level Full Time

San Jose, California, United States

21d ago

Member of Technical Staff - Imagine Model USD 180K-440K

Audio Processing | C++ | Computer Vision | Data Annotation | Data Augmentation

401k | Dental insurance | Disability insurance | Employee discounts | Health insurance

Senior-level Full Time

Palo Alto, CA; Seattle, WA

21d ago

Senior Software Engineer, Managed AI - AI Model LifeCycle USD 172K-209K

Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback

401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributions

Senior-level Full Time

San Francisco, CA - US

22d ago

Principal Engineer, AI Model LifeCycle USD 260K-326K

Adapters | Checkpointing | DPO | DeepSpeed | Distributed Training

Cell phone stipend | Commuter benefits | Dental insurance | Health insurance | Mental health wellness support

Senior-level Full Time

San Francisco, CA - US

22d ago

Research Scientist – LTX Model Quality

Audio generation | Benchmarking | Computer Vision | DPO | Deep learning

Mid-level Full Time

Haifa

23d ago

Research Scientist – LTX Model Quality

Benchmark design | Computer Vision | Deep learning | Direct Preference Optimization | Evaluation metrics

Car to go subscriptions | Free parking | Learning opportunities | On site bakery | On-site restaurants

Mid-level Full Time

Jerusalem

23d ago

Staff AI Engineer, Model Post-Training and Alignment USD 196K-268K

Benchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy Optimization

Company events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowances

Senior-level Full Time

APAC

23d ago

Research Scientist - LLM Foundation Models TWD 1200K-1500K

A Star | C plus plus | Data Augmentation | Deep learning | Fine Tuning

Entry-level Full Time

Taiwan, Taipei R

23d ago

Agent Framework Architect A USD 197K-291K

AI Platform | AI platform development | Agent Framework | Agent Orchestration | Agent framework architectures

Equity | Full benefits

Senior-level Full Time

Palo Alto, CA

30d ago

Applied Research - Forward-Deployed USD 280K-350K

Agent Frameworks | Ambiguity tolerance | Artifact Development | Communication skills | Customer Engagement

Competitive pay | Conference attendance | Development budget | Equity | Flexible work

Senior-level Full Time

San Francisco

1mo ago

Engineering Manager, Core Applications - AI for Member Systems USD 523K

API Design | Applied Machine Learning | Evaluation Pipelines | Fine Tuning | GenAI

Disability programs | Family-forming benefits | Flexible spending accounts | Health plans | Health savings accounts

Mid-level Full Time

USA - Remote, United States R

1mo ago

Staff Research Engineer, MetaAI Assistant Measurement USD 213K-293K

A/B | A/B Testing | AI interaction | AI systems | B testing

Senior-level Full Time

Bellevue, WA | New York, NY

1mo ago

Find jobs in AI/ML, Data Science and Big Data