aijobs.net

Research Scientist, LLM Evaluation & Post-Training

Remote Work( USA), United States R

USD 150K-300K Mid-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI Feedback | Alignment | Benchmarking | Context evaluation | Deep learning | Direct Preference Optimization | Error Analysis | Experimental Design | Fine Tuning | Generalized Reward Policy Optimization | Hugging Face | Human Feedback | Human evaluation | Inter-rater reliability | JAX | LLM Evaluation | Learning from Human Feedback | Long Context | Long Context Evaluation | Machine Learning | Multimodal evaluation | Policy Optimization | Preference optimization | Proximal Policy Optimization | PyTorch | Python | RAG | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Robustness Testing | Safety | Significance Testing | Statistical Analysis | Supervised Fine Tuning | TensorFlow | Uncertainty Quantification | Vector Databases

Education

Master of Science | PhD

Roles

Machine Learning Researcher | Research Scientist | Researcher | Scientist

Regions

North America

Countries

United States

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs