28 jobs for Preference optimization

Machine Learning Engineer, Chakra USD 120K-235K

Agentic AI | Benchmarking | Conversational AI | Data Pipelines | Deep learning

Mid-level Full Time

Hybrid in Santa Clara, CA R

23h ago

Senior Data Scientist - (Query Intelligence, Global Discovery) EUR 67K-80K

AWS | Agent Orchestration | Autogen | Autonomous Agents | Direct Preference Optimization

Bicycle subsidy | Corporate discounts | Corporate pension plan | Digital meal vouchers | Educational budget

Senior-level Full Time

Berlin, Germany

1d ago

Staff Software Engineer, Generative AI, Core ML USD 207K-300K

AI Feedback | Computer Vision | Data Processing | Deep learning | Digital Twin

Senior-level Full Time

Mountain View, CA, USA

2d ago

Data Scientist Associate INR 1068K-1496K

API Development | AWS | Agentic AI | Apache Spark | Azure

Mid-level Full Time

Bengaluru, Karnataka, India

2d ago

Machine Learning Engineer (Post-Training) EUR 57K-84K

AWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference Optimization

Senior-level Full Time

Paris, France

2d ago

Senior Product Manager, LLM. SGD 132K-156K

AI infrastructure | Cost Optimization | Cost Performance | Cost-performance optimization | Data Quality

Senior-level Full Time

Crimson House Singapore

2d ago

AI Agents Applied Engineer - Senior Associate USD 148K-240K

A/B | A/B Testing | Auditability | B testing | Bandit Algorithms

Backup childcare | Financial coaching | Flexible benefits | Health care coverage | Mental health support

Senior-level Full Time

Brooklyn, NY, United States

2d ago

Senior Applied Scientist USD 180K-230K

Direct Preference Optimization | Distributed Training | Human Feedback | LLM-as-a-Judge | Language Models

Senior-level Full Time

Palo Alto

3d ago

Distinguished, Software Engineer -AI/ML Engineer- Walmart Connect USD 169K-338K

AWS | Attribution Modeling | Azure | C++ | Chain-of-Thought

401k | Health benefits | Paid time off | Parental leave | Stock purchase

Senior-level Full Time

(USA) Crossman Service Building CA SUNNYVALE …

8d ago

Research Scientist (Seed-LLM) USD 244K-450K

Data Construction | Deep learning | Inference Optimization | Instruction Tuning | Language Models

Mid-level Full Time

San Jose, California, United States

9d ago

LLM Engineer (Reinforcement Learning)

DDP | Deep learning | Direct Preference Optimization | Distributed Training | Docker

Senior-level Full Time

Pangyo (Software Dream Center), South Korea

9d ago

AI Research Scientist Intern (PhD), Embodied AI USD 93K-180K

Action models | Deep learning | Diffusion Models | Fine Tuning | Imitation Learning

In office collaboration 5 days per week | Publication opportunities

Entry-level Internship

Milpitas, CA

9d ago

Senior Applied Scientist USD 142K-270K

Diffusion Models | Direct Preference Optimization | Fine Tuning | Human Feedback | Inference acceleration

Senior-level Full Time

Seattle, United States

9d ago

大模型应用算法工程师/专家 CNY 240K-480K

C++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer Dialogue

Senior-level Full Time

上海、北京

10d ago

Senior Applied AI Manager USD 170K-234K

Agent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixing

Senior-level Full Time

San Mateo, CA

10d ago

Agent RL Infra Engineer USD 224K-356K

AI Feedback | Active Learning | Cluster management | Continuous Learning | Data Curation

Senior-level Full Time

US, CA, Santa Clara, United States

12d ago

Applied Reinforcement Learning Engineer USD 150K-160K

Actor-critic | Agent systems | BCQ | Behavioral cloning | CQL

Equal opportunity employer | Hybrid remote work | Research publications opportunity

Mid-level Full Time

Remote Work( USA), United States R

15d ago

校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练（博士优先） CNY 500K-500K

Adapters | Direct Preference Optimization | Fine Tuning | Flax | Function design

None Full Time

上海

21d ago

Senior Staff Software Engineer, Model LifeCycle USD 237K-288K

API Design | CUDA | Checkpointing | DPO | DeepSpeed

401k matching | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributions

Senior-level Full Time

Tel Aviv - IL

21d ago

Senior Software Engineer, Managed AI - AI Model LifeCycle USD 172K-209K

Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback

401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributions

Senior-level Full Time

San Francisco, CA - US

21d ago

Staff Software Engineer, Model LifeCycle USD 208K-253K

API Design | Checkpointing | Distributed Training | Failure recovery | Fine Tuning

401k match | Cell phone stipend | Commuter benefits | Dental insurance | Employer HSA contributions

Senior-level Full Time

San Francisco, CA - US

22d ago

Senior Data Scientist - (Query Intelligence, Global Discovery) EUR 64K-85K

Agent Orchestration | Amazon Web Services | Auto Planning | Autogen | Direct Preference Optimization

Bicycle subsidy | Corporate discounts | Corporate pension plan | Digital meal vouchers | Educational budget

Senior-level Full Time

Berlin, Germany

22d ago

Research Scientist – LTX Model Quality

Benchmark design | Computer Vision | Deep learning | Direct Preference Optimization | Evaluation metrics

Car to go subscriptions | Free parking | Learning opportunities | On site bakery | On-site restaurants

Mid-level Full Time

Jerusalem

22d ago

Staff AI Engineer, Model Post-Training and Alignment USD 196K-268K

Benchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy Optimization

Company events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowances

Senior-level Full Time

APAC

22d ago

Applied Scientist 3 USD 120K-238K

Fine Tuning | Inference Optimization | Machine Learning | Model Compression | Model Distillation

Mid-level Full Time

San Jose, United States

22d ago

Senior ML Engineer – Distributed RL & Post-Training Infrastructure A USD 204K-350K

Automated testing | Cryptography | Direct Preference Optimization | Distributed Systems | Docker

Senior-level Full Time

Remote R

23d ago

Senior AI Research Scientist (6240) USD 170K-270K

Adversarial Learning | Attention Networks | Dash | Data Preprocessing | Data Wrangling

Hybrid work schedule | Professional development programs | Travel for training and team building

Senior-level Full Time

San Jose, CA, US

1mo ago

Applied Scientist II, Sponsored Products and Brands - Advertiser Guidance USD 142K-223K

AI Safety | Agent architectures | Agent-based | Agent-based AI | Data Curation

Mid-level Full Time

Seattle, Washington, USA

1mo ago

Find jobs in AI/ML, Data Science and Big Data