AI/ML Research Scientist, LLM Post-Training & Evaluation

Redmond, Washington, United States

USD 150K-160K Mid-level Full Time

@ C...

Apply Save

Found 1d ago

Tasks

Analyze model behavior and failure patterns
Collaborate with data scientists and engineers
Conduct human evaluation and rubric design
Create benchmark datasets and test suites
Define LLM evaluation research agenda
Design LLM post training experiments
Design scoring methods and evaluation protocols
Develop LLM evaluation frameworks
Engage with customer technical stakeholders
Evaluate human vs automated evaluation methods
Perform robustness and stress testing
Produce technical reports and documentation
Publish research and open-source contributions
Study long context and multimodal evaluation
Translate research methods into evaluation pipelines

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs

Lead Data Scientist USD 150K-175K

Classification | Cloud Computing | Clustering | Clustering Analysis | Computer Vision

401k matching | Dental and vision care | Employee assistance program | Employee discount program | Health and wellbeing benefits

Senior-level Full Time

Remote - Nationwide, United States R

2h ago
Applied AI Scientist, Agentic AI USD 100K-120K

A2A protocol | Accuracy | Agentic AI | Autogen | CrewAI

401k match | Dental insurance | Employee assistance program | Flexible schedule | Health insurance

Mid-level Full Time

Work From Home, United States R

2h ago
Staff AI Engineer USD 130K-185K

A/B | A/B Testing | ARIMA | Anomaly Detection | Apache Spark

Senior-level Full Time

Center, Center District, IL

4h ago
Data Scientist USD 154K-190K

A/B | A/B Testing | Anomaly Detection | Apache Spark | B testing

Senior-level Full Time

Center, Center District, IL

4h ago
AI Research Scientist, SysML - FAIR USD 143K-208K

Artificial Intelligence | C# | C++ | Co-design | Compiler design

Mid-level Full Time

Menlo Park, CA | Boston, MA …

8h ago
Research Scientist Intern, Multimodal Generative AI and Robotics (PhD) USD 91K-145K

3D machine perception | Benchmarking | Computer Vision | Deep learning | Generative Modeling

Entry-level Internship

Redmond, WA

8h ago
Research Scientist, AI & Systems Co-design (PhD) USD 117K-173K

C# | C++ | Communication optimization | Compiler optimization | Deep learning

None Full Time

Menlo Park, CA

8h ago
Research Scientist Intern, Robotic Control Policy (PhD) USD 130K-204K

Control Theory | Dynamics | Imitation Learning | JAX | Kinematics

Entry-level Internship

Redmond, WA | Burlingame, CA

8h ago
AI Research Scientist, Media Data Research - MSL FAIR USD 117K-173K

Apache Spark | Computer Vision | Data Curation | Data Generation | Data Scaling Laws

Entry-level Full Time

Menlo Park, CA

8h ago
Research Scientist Intern, Applied Perception Science (PhD) USD 91K-145K

Bias Mitigation | Computational modeling | Computer Vision | Data Analysis | Data Set

Entry-level Internship

Redmond, WA

8h ago
AI Research Scientist - FAIR Social Intelligence USD 144K-251K

Artificial Intelligence | Computational statistics | Game theory | Machine Learning | Python

Entry-level Full Time

Bellevue, WA | Seattle, WA

8h ago
Research Scientist Intern, Machine Perception for Input and Interaction (PhD) USD 91K-145K

3D computer graphics | Action Recognition | Architecture Search | C++ | Cloud processing

Entry-level Internship

Redmond, WA | Seattle, WA

8h ago
Data Scientist, Products & Applied Research USD 173K-235K

Bias Mitigation | Causal Inference | Data Analysis | Data Mining | Experimentation

Career growth

Mid-level Full Time

Menlo Park, CA

8h ago
AI Research Scientist - Meta Superintelligence Labs (PhD) USD 170K-208K

Automatic Speech Recognition | Fine Tuning | Language Models | Language Processing | Large Language Models

Senior-level Full Time

Menlo Park, CA

8h ago
Data Scientist, Technical Lead - Infrastructure Data Center (IDC) USD 150K-211K

AI ethics | Agent Orchestration | Bias Mitigation | Capacity Planning | Data Storytelling

Senior-level Full Time

Bellevue, WA | Menlo Park, CA

8h ago
Postdoctoral Researcher, Fundamental AI Research (PhD) USD 117K-145K

Computational statistics | Computer Vision | Data Compression | Deep learning | Generative Modeling

Entry-level Full Time

Menlo Park, CA

8h ago
Data Scientist, Analytics (Technical Leadership) USD 160K-190K

AI Workflow Optimization | AI workflow | Agent Orchestration | Bias Mitigation | Causal Inference

Career development | World class analytics community

Senior-level Full Time

Remote, US | Bellevue, WA | … R

8h ago
Research Scientist, Central Applied Science (PhD) USD 112K-173K

Agent Orchestration | Algorithm Development | Apache Hive | Apache Spark | Artificial Intelligence

Work authorization support

Entry-level Full Time

Menlo Park, CA | New York, …

8h ago
Data Scientist USD 203K-235K

A/B | A/B Testing | B testing | Data Mining | Python

Mid-level Full Time

Menlo Park, CA

8h ago
AI Research Scientist - MSL FAIR Foundations USD 147K-251K

Benchmarking | Deep learning | Evaluation methodology | Language Model | Language Model Evaluation

Mid-level Full Time

Menlo Park, CA

8h ago
Audio Algorithm Architect, Applied Research USD 237K-329K

Acoustic Modeling | Audio signal processing | C plus plus | Deep learning | JAX

Senior-level Full Time

Irvine, CA, USA

8h ago
Data Scientist Computer Vision USD 109K-164K

AWS | AWS SageMaker | Active Learning | Airflow | CI/CD

Dental insurance | Health insurance | Paid time off | Retirement plan | Sick leave

Mid-level Full Time

Chesterfield, Missouri, US

11h ago
Senior Data Scientist USD 132K-187K

A/B | A/B Testing | B testing | Data Analysis | Data Pipelines

Commuter benefits | Disability benefits | Equity awards | Financial wellness support | Health insurance

Senior-level Full Time

San Jose, California

15h ago
Senior Data Scientist, SPB Global Optimization USD 175K-236K

Experiment design | Machine Learning | Python | R | SAS

Senior-level Full Time

New York, New York, USA

19h ago
Data Scientist Consultant USD 125K-267K

Clustering | Data Visualization | Data Wrangling | Language Processing | Machine Learning

401k plans | Flexible vacation policy | Hybrid work model | Medical and dental coverage | Paid time off for holidays

Mid-level Full Time

Hoboken, NJ, US, 07030 R

19h ago

AI/ML Research Scientist, LLM Post-Training & Evaluation

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs