27 jobs for Policy Optimization

具身智能算法工程师-模型 CNY 500K-500K

Actor-critic | Deep learning | Distributed Training | GPU Training | IQL

Mid-level Full Time

北京 R

21h ago

Helix AI Engineer, Reinforcement Learning USD 150K-350K

Credit Assignment | Distributed Training | Experiment Management | Exploration | Model-based reinforcement learning

In-office collaboration

Senior-level Full Time

San Jose, CA

23h ago

AI Engineer - Reinforcement Learning EUR 60K-84K

Data Pipelines | Evaluation Frameworks | Fine Tuning | Human-in-the-loop | Language model fine-tuning

Senior-level Full Time

Paris, France

1d ago

Staff Software Engineer, Generative AI, Core ML USD 207K-300K

AI Feedback | Computer Vision | Data Processing | Deep learning | Digital Twin

Senior-level Full Time

Mountain View, CA, USA

2d ago

Machine Learning Engineer (Post-Training) EUR 57K-84K

AWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference Optimization

Senior-level Full Time

Paris, France

2d ago

Data Scientist (Reinforcement Learning) Intern SGD 42K-57K

AMQP | GitHub | Gym | HTTP | Mqtt

Entry-level Full Time Internship

Singapore

3d ago

Decision Intelligence Engineer - Next Best Action USD 129K-177K

A3C | Backtesting | Bellman Equation | Conservative Q Learning | Constraint Mapping

401k retirement savings plan | Medical, dental, and vision benefits | Occasional travel | Remote work | Time off

Senior-level Full Time

Remote US, United States R

7d ago

具身智能-多模态强化学习算法专家 CNY 240K-480K

Actor-critic | Deep Q-Network | Isaac Sim | LLM | Mujoco

Senior-level Full Time

北京、上海

7d ago

LLM Engineer (Reinforcement Learning)

DDP | Deep learning | Direct Preference Optimization | Distributed Training | Docker

Senior-level Full Time

Pangyo (Software Dream Center), South Korea

9d ago

Data Science Intern GBP 25K-25K

Agile | Constrained optimization | Continuous Delivery | Experiment tracking | Gymnasium

Annual bonus | Charitable Causes Initiatives | Health insurance | Pension | Retention Bank

Entry-level Internship

London, GB

9d ago

大模型应用算法工程师/专家 CNY 240K-480K

C++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer Dialogue

Senior-level Full Time

上海、北京

9d ago

Senior Applied AI Manager USD 170K-234K

Agent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixing

Senior-level Full Time

San Mateo, CA

10d ago

Tech Lead Manager- MLRE, ML Systems USD 264K-331K

CUDA | Distributed Systems | Flash Attention | GRPO | Human Feedback

Commuter stipend | Generous PTO | Health, dental and vision coverage | Learning and development stipend | Retirement benefits

Senior-level Full Time

San Francisco, CA; New York, NY

10d ago

Machine Learning Engineer, Open-Source Software - Paris/London EUR 60K-78K

Fine Tuning | Hugging Face | JAX | Language Processing | Llama CPP

Generous parental leave policy | Health insurance | Meal vouchers | Private pension plan | Sport allowance

Mid-level Full Time

Paris

12d ago

Agent RL Infra Engineer USD 224K-356K

AI Feedback | Active Learning | Cluster management | Continuous Learning | Data Curation

Senior-level Full Time

US, CA, Santa Clara, United States

12d ago

Sr Machine Learning Engineer USD 159K-236K

AWS | Alignment Tuning | Anomaly Detection | Azure | BERT

Financial security support | Flexible hybrid work model | Healthcare coverage | Mental health resources | Paid time off

Senior-level Full Time

USA - California - San Jose …

14d ago

AI Engineer - Reinforcement Learning (Senior) CHF 128K-192K

Artificial neural networks | Autonomy | C plus plus | Computer Vision | Deep learning

In-person collaboration

Senior-level Full Time

Zürich

15d ago

Applied Scientist - Agentic AI, Amazon Fulfillment Technology USD 142K-193K

Agent systems | DPO | Deep learning | Evaluation | Fine Tuning

Mid-level Full Time

Bellevue, Washington, USA

15d ago

Applied Reinforcement Learning Engineer USD 150K-160K

Actor-critic | Agent systems | BCQ | Behavioral cloning | CQL

Equal opportunity employer | Hybrid remote work | Research publications opportunity

Mid-level Full Time

Remote Work( USA), United States R

15d ago

Data Science Researcher ILS 341K-443K

A I | A I Safety | A/B | A/B Testing | AWS Bedrock

Career growth opportunities | Flexible schedule | Hybrid work model | Mentoring | Remote work flexibility

Senior-level Full Time

Israel - Raanana

16d ago

Senior Software Engineer, Managed AI - AI Model LifeCycle USD 172K-209K

Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback

401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributions

Senior-level Full Time

San Francisco, CA - US

21d ago

Staff Software Engineer, Model LifeCycle USD 208K-253K

API Design | Checkpointing | Distributed Training | Failure recovery | Fine Tuning

401k match | Cell phone stipend | Commuter benefits | Dental insurance | Employer HSA contributions

Senior-level Full Time

San Francisco, CA - US

21d ago

Staff AI Engineer, Model Post-Training and Alignment USD 196K-268K

Benchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy Optimization

Company events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowances

Senior-level Full Time

APAC

22d ago

Senior ML Engineer – Distributed RL & Post-Training Infrastructure A USD 204K-350K

Automated testing | Cryptography | Direct Preference Optimization | Distributed Systems | Docker

Senior-level Full Time

Remote R

23d ago

Researcher, Loss of Control USD 295K-445K

Algorithms | Data Structures | Deep learning | Evaluation | Fine Tuning

Senior-level Full Time

San Francisco

1mo ago

Senior AI Research Scientist (6240) USD 170K-270K

Adversarial Learning | Attention Networks | Dash | Data Preprocessing | Data Wrangling

Hybrid work schedule | Professional development programs | Travel for training and team building

Senior-level Full Time

San Jose, CA, US

1mo ago

Research Engineer / Machine Learning Engineer - B2B Applications USD 295K-445K

Algorithms | Data Structures | Deep learning | Distillation | Fine Tuning

Collaborative environment | Cutting-edge technology | Impactful work | Learning opportunities

Mid-level Full Time

San Francisco

1mo ago

Find jobs in AI/ML, Data Science and Big Data