aijobs.net

Principal Research Engineer, Model Training & Post-Training

Palo Alto, California, United States

USD 400K-550K Senior-level Full Time

Apply Save
Found 19h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI Feedback | Checkpointing | Cost Performance | Cost-performance tradeoffs | Data Decontamination | Data Deduplication | Deep learning | Direct Preference Optimization | Distillation | Distributed Training | Fault Tolerance | Fine Tuning | GPU clusters | Group Relative Policy Optimization | Human Feedback | Human-in-the-loop | Observability | Performance tradeoffs | Policy Optimization | Preference optimization | Regression Detection | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning with Human Feedback | Reproducibility | Reward Modeling | Supervised Fine Tuning | Synthetic data | The Loop | Throughput Optimization | Tool Use Fine Tuning | Tool use | Transformer

Education

PhD

Roles

Engineer | Principal | Principal Research Engineer | Research Engineer

Regions

North America

Countries

United States

States

California, US

Cities

Palo Alto, California, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs