86 jobs for Reinforcement Learning from Human Feedback

Foundation Model Engineer USD 100K-150K

Adapter Layers | DPO | Dataset curation | DeepSpeed ZeRO | Distributed Training

Senior-level Full Time

United States - Remote R

1d ago

Applied Machine Learning Researcher A AUD 130K-180K

DPO | Data Curation | Data labeling | Evaluation Frameworks | Experimentation

Fully remote | High ownership role | In-person gatherings | Opportunities for growth | Remote flexibility

Senior-level Full Time

Australia R

4d ago

顶尖应届-影像大模型算法研究员-相机 CNY 25K-37K

Agent systems | Bokeh | C# | C++ | CMake

Entry-level Full Time

上海

4d ago

Generative AI Engineer USD 100K-150K

Adapter-Tuning | Benchmarking | DPO | Direct Preference Optimization | Distributed Training

Senior-level Full Time

United States - Remote R

4d ago

LLM Engineer USD 100K-150K

Adapters | DPO | Dataset curation | Deep reinforcement learning | Efficient Attention

Senior-level Full Time

United States - Remote R

4d ago

Applied Scientist, Sponsored Products Off-Search Auction USD 142K-223K

A/B | A/B Testing | Ad Auctions | Auction theory | B testing

Mid-level Full Time

Seattle, Washington, USA

4d ago

Senior Solutions Engineer, AI Data & Model Evaluation Solutions A CAD 152K-188K

Artificial Intelligence | Benchmarking | Data Annotation | Data Operations | Human Feedback

Cross-functional collaboration | Exposure to cutting-edge AI projects | Fully remote work | High-impact role | Professional development opportunities

Senior-level Full Time

Canada R

5d ago

Engineering Manager, Machine Learning & NLP, Input Experience USD 175K-285K

Data Engineering | Data Synthesis | Dataset curation | Efficient Fine Tuning | Experimentation

Mid-level Full Time

Cupertino

5d ago

Senior Quantitative Finance Subject Matter Expert (AI Evaluation) | U.S. USD 110K-120K

AI Quality Assurance | AI annotation | AI quality | Benchmark Dataset Development | Bloomberg terminal

Project-based engagements | Remote work

Senior-level Full Time

United States - Remote R

5d ago

Senior Quantitative Finance SME (AI Evaluation) | Pakistan/India USD 30K-38K

AI Quality Assurance | AI quality | Benchmark Dataset Development | Bloomberg terminal | Capital IQ

Flexible hours | Project-based assignments | Remote work

Senior-level Full Time

Pakistan - Remote R

5d ago

Foundation Model Engineer USD 100K-150K

Adapter methods | Attention Mechanisms | Dataset curation | Direct Preference Optimization | Distributed Training

Senior-level Full Time

United States - Remote R

5d ago

Machine Learning Manager, Search & Knowledge Platforms USD 175K-301K

AI Feedback | Hallucination reduction | High Availability | Human Feedback | Language Models

Mid-level Full Time

Santa Clara; Seattle

5d ago

Foundation AI Engineer (LLM) CAD 100K-110K

AI Feedback | Attention Mechanisms | Constitutional AI | Constitutional Safety Tuning | Data Curation

Annual health checkups | Healthcare insurance | Opportunity to collaborate with industry professionals | Performance bonuses | Preferential pricing for services

Mid-level Full Time

Hanoi, Vietnam

5d ago

Applied Scientist, Sponsored Products and Brands USD 142K-223K

A/B | A/B Testing | Ads Ranking | B testing | C++

Mid-level Full Time

Seattle, Washington, USA

6d ago

Senior Machine Learning Engineer, Agent Oversight USD 216K-270K

Agent Orchestration | Anomaly Detection | Drift Detection | Evaluation Methodologies | Experimentation

Dental insurance | Health insurance | Learning and development stipend | Paid time off | Retirement benefits

Senior-level Full Time

San Francisco, CA; New York, NY

6d ago

AIML - Applied Research Engineer, Machine Translation EUR 90K-103K

Amazon Web Services | C++ | Cloud platform | Dask | Data Generation

Senior-level Full Time

Aachen

6d ago

Applied Scientist II / Senior Applied Scientist - Responsible AI (CoreAI) USD 102K-261K

API Design | C# | Data Processing | Debugging | Deep learning

Mid-level Full Time

United States, Washington, Redmond; United States, …

10d ago

多模态大模型算法工程师 CNY 240K-480K

Agile | Computer Vision | Data Annotation | Data Deduplication | Data Filtering

Senior-level Full Time

上海

12d ago

多模态大模型算法工程师 CNY 240K-480K

Agile | Attribution Analysis | Automatic Labeling | Computer Vision | Data Ablation

Senior-level Full Time

成都

12d ago

IN_Senior Associate_Data Science + Gen AI_ GCC_Advisory_Hyderabad INR 1800K-5000K

API Integration | Anthropic | Embeddings | Faiss | Hugging Face

Senior-level Full Time

Hyderabad - Salarpuria, India

12d ago

Staff Data Scientist CAD 127K-203K

A/B | A/B Testing | AWS | AWS Glue | Amazon Athena

Health benefits | Hybrid work model | Remote eligible

Senior-level Full Time

Canada R

12d ago

Staff Data Scientist USD 133K-272K

A/B | A/B Testing | AWS | Athena | B testing

Hybrid work

Senior-level Full Time

Remote, USA R

12d ago

Applied Reinforcement Learning Engineer USD 150K-300K

A2C | A3C | Actor-critic | Agent systems | BCQ

Collaborate with industry leaders | Equal opportunity employer | Hybrid remote work | Research publications support

Mid-level Full Time

Remote Work( USA), United States R

13d ago

Research Engineer BRL 200K-240K

Ablation Studies | Computer Vision | Data Decontamination | Data Deduplication | Data Generation

Autonomy | Rapid iteration | Remote work flexibility | Talent dense team

Mid-level Full Time

Brazil

13d ago

Senior Data Scientist - Applied AI EUR 45K-45K

Agentic AI | Autogen | Big Data | Cloud Platforms | Data Pipelines

Senior-level Full Time

Bratislava, Bratislavsky kraj, Slovakia

13d ago

GenAI Engineer - Product Marketing & Adoption ZAR 456K-540K

Adapter-Tuning | Apache Spark | Function Calling | Generative AI | Google Cloud

Casual work environment | Flexible work hours | Internet allowance | Medical aid | Provident fund

Entry-level Full Time

Capetown

13d ago

AI Integration Engineer USD 125K-188K

AWS | AWS Lambda | Algorithms | Amazon S3 | Artificial Intelligence

401k matching | Commuter benefits | Dental insurance | Free snacks | Health insurance

Mid-level Full Time

Washington D.C.

14d ago

Staff Machine Learning Engineer - AI Cloud USD 198K-330K

A/B | A/B Testing | Active Learning | Agent Orchestration | Automated retraining

Senior-level Full Time

San Francisco, CA; New York, NY; …

14d ago

AI Researcher

Big Data | Chain-of-Thought | Clustering | Cost Optimization | Data Classification

Mid-level Full Time

Tel Aviv-Yafo, Tel Aviv District, IL

14d ago

Algorithm Engineer, TikTok E-Commerce (Conversational AI) USD 136K-252K

A/B | A/B Testing | B testing | Continual Learning | Data Set

Senior-level Full Time

Seattle, Washington, United States

14d ago

Algorithm Engineer, TikTok E-Commerce (Conversational AI) USD 136K-280K

A/B | A/B Testing | B testing | Continual Learning | Dataset Construction

Senior-level Full Time

San Jose, California, United States

14d ago

Research Scientist, Agent Post Training, DeepMind USD 174K-253K

Data Analysis | Distributed Training | Experimentation | Flax | Human Feedback

Mid-level Full Time

Mountain View, CA, USA

17d ago

Senior Applied Scientist, Amazon Brand Stores USD 167K-248K

A/B | A/B Testing | B testing | C++ | Data Analysis

401k matching | Dental insurance | Employee assistance program | Health insurance | Health support programs

Senior-level Full Time

Seattle, Washington, USA

19d ago

Sr Software Dev Engineer, Stores Foundational AI -SFAI USD 168K-227K

Async Rollouts | Batching | Distributed Systems | Experiment tracking | GPU Utilization

401k matching | Dental insurance | EAP | Flexible spending accounts | Health insurance

Senior-level Full Time

Seattle, Washington, USA

22d ago

Agent Post-Training, Frontier Evals and Environments Research USD 295K-445K

AI Feedback | Data Pipelines | Evaluation | Experiment design | Grading

Mid-level Full Time

San Francisco

24d ago

Agent Post-Training, API & Power Users USD 295K-445K

AI Feedback | Agent systems | Computer use | Cost Optimization | Data Generation

Senior-level Full Time

San Francisco

24d ago

Agent Post-Training Research USD 295K-445K

AI Feedback | Agent systems | Calibrated Reasoning | Data Pipelines | Deep learning

Mid-level Full Time

San Francisco

26d ago

Junior Foundation AI Engineer EUR 30K

AWS | Accelerate | Azure | CUDA | Cloud Computing

Corporate welfare | Health insurance | Meal vouchers | Smart working | Training

Entry-level Full Time

Milano (Bassi), Italy

26d ago

Machine Learning Engineer - Reinforcement Learning EUR 54K-75K

Data Pipelines | Evaluation | Fine Tuning | Human Feedback | LLM Fine-tuning

Senior-level Full Time

Paris, France

26d ago

FY26 Intern - Deep Learning Researcher Internship - 4 months, Amsterdam EUR 25K-25K

Artificial Intelligence | Autoregressive modeling | Computer Vision | Conditional Computation | Deep learning

Flexible start dates | Holiday pay | On site work in Amsterdam | Relocation assistance | Sick pay

Entry-level Internship

Amsterdam, North Holland, Netherlands

26d ago

Staff Software Engineer, AI/ML USD 216K-271K

AI Feedback | Agentic AI | Data Pipelines | Direct Preference Optimization | Experimentation platforms

Conference reimbursement | Education reimbursement | Employee assistance program | Employee stock purchase program | Equity compensation

Senior-level Full Time

Seattle

27d ago

Senior Machine Learning Engineer, Computer Vision/VLM USD 204K-259K

AI Feedback | Computer Vision | Data Processing | Data Processing Pipelines | Deep learning

Senior-level Full Time

Mountain View, CA, USA; San Francisco, …

27d ago

Senior Solutions Architect, Generative AI Research USD 184K-287K

AI Agents | AI Feedback | Agent evaluation | Artificial Intelligence | Batching

Senior-level Full Time

US, FL, Remote, United States R

27d ago

Senior Applied Scientist USD 142K-270K

Data Pipelines | Diffusion Models | Direct Preference Optimization | Evaluation metrics | Fine Tuning

Senior-level Full Time

Seattle, United States R

27d ago

Director, Reinforcement Learning & Agentic Post-Training EUR 151K-200K

AI Feedback | API Integration | Distributed Training | Environment Design | Evaluation

Executive-level Full Time

Paris, France

27d ago

Staff Machine Learning Engineer GBP 155K-163K

Data Processing | Deep learning | Distributed Training | Generative AI | Human Feedback

Company benefits program | Discretionary annual bonus | Equity incentive plan

Senior-level Full Time

London, UK

28d ago

Senior AI Solution Architect

Agentic Workflows | Amazon AgentCore | Amazon Bedrock | Amazon Kendra | Artificial Intelligence

Senior-level Full Time

Seoul, KOR

29d ago

Senior Applied Scientist, Alexa AI USD 167K-227K

Agentic Architectures | Automated Training | Automated training pipelines | C++ | DPO

Senior-level Full Time

Turin, Piedmont, ITA

29d ago

Machine Learning Engineer (Generative AI) USD 131K-180K

Data Preprocessing | Deep learning | Generative AI | Human Feedback | Language Models

Collaborative work culture | Health and wellness programs | Mentorship | Relocation assistance

None Full Time

Santa Clara,CA, United States

29d ago

Senior Applied Scientist, Sponsored Products and Brands USD 167K-248K

A/B | A/B Testing | B testing | C++ | Data Analysis

401k matching | Health and wellness benefits | Health insurance | Paid time off | Parental leave

Senior-level Full Time

New York, New York, USA

1mo ago

Find jobs in AI/ML, Data Science and Big Data