Research Scientist, LLM Evaluation & Post-Training

Remote Work( USA), United States R

USD 150K-300K Mid-level Full Time

@ C...

Apply Save

Found 1mo ago

Tasks

Analyze model behavior and failure patterns
Build benchmark datasets and test suites
Conduct human evaluation and rubric design
Create technical documentation and research reports
Define LLM evaluation research agenda
Design LLM post training experiments
Design scoring methods and evaluation metrics
Develop LLM evaluation frameworks
Develop scalable evaluation and post training pipelines
Engage with customer technical stakeholders on evaluation goals
Integrate human in the loop and synthetic evaluation strategies
Provide actionable model improvement recommendations
Publish and present research findings
Run robustness and stress testing

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en Views: 6

Clicks: 0

Saves: 0

Related jobs

Staff Applied Scientist USD 193K-242K

Algorithms | Data Analysis | Machine Learning | Machine Learning Inference | Metrics

401k match | Child care benefits | Family building benefits | Lyft Pink membership | Lyft credits

Senior-level Full Time

San Francisco, CA R

19h ago
Staff Applied Data Scientist, Pricing USD 207K-244K

A/B | A/B Testing | B testing | Causal Inference | Data Modeling

Quarterly in person surges | Remote-first work

Senior-level Full Time

Remote - USA R

2d ago
AI Scientist USD 100K-150K

Accelerator hardware | Deep learning | Distributed Training | JAX | Language Models

Remote work

Senior-level Full Time

United States - Remote R

2d ago
Applied AI Data Scientist USD 180K-240K

Agile | Causal Inference | Chain-of-Thought | Computer Vision | Deep learning

Annual team retreat | Flexible schedule | Free Lunches | Health insurance | Remote work

Mid-level Full Time

LA office / US remote R

2d ago
Senior Applied AI/ML Scientist - Retailer USD 211K-290K

C plus plus | Causal Inference | Deep learning | Experimentation | Java

Equity | Health benefits | Hybrid work | Paid time off | Remote work up to 4 weeks per year

Senior-level Full Time

San Francisco, CA R

2d ago
Sr. Data Scientist USD 133K-160K

Business Intelligence | Data Analysis | Language Models | Large Language Models | Looker

401k match | Equity | Flexible time off | Health insurance

Senior-level Full Time

Raleigh, NC R

2d ago
Data Scientist USD 90K-120K

AWS | Azure | Cypress | Databricks | Docker

Remote work

Mid-level Full Time

Remote, USA R

3d ago
Staff Data Scientist - Experimentation & Measurement USD 212K-318K

A/B | A/B Testing | B testing | Causal Inference | Experimental Methods

401k matching | Employee discounts | Medical/Dental/Vision | Paid time off | Wellness program

Senior-level Full Time

United States, Remote R

3d ago
Military Advisor- Data Science/Sentiment Analysis (Part Time/Remote) (Mission Assurance 4)- 29524 USD 106K-150K

APIs | AWS | Azure | BERT | BeautifulSoup

Episodic travel | Remote work | Training opportunities

Senior-level Part Time

Camp HM Smith, HI, Remote, United … R

3d ago
Senior Scientist, Data Science - Hybrid BRL 106K-132K

Anomaly Detection | Causal Inference | Decision Making | Deep learning | Demand forecasting

401k match | Disability insurance | Education assistance | Full health insurance on day one | Incentive plan

Senior-level Full Time

Boston, MA, US, 02110 R

3d ago
Natural Language Measurement Specialist USD 88K-145K

Dimensionality | Experimental Design | Fairness | Fine Tuning | Generalizability theory

Comprehensive benefits package | Occasional travel for business | Professional development | Remote work

Mid-level Full Time

Remote - New York, United States R

3d ago
Data Science Engineer USD 133K-236K

AWS | DAX | Databricks | Git | Machine Learning

Mid-level Full Time

San Jose, United States R

3d ago
Principal ML Scientist USD 215K-323K

Agent systems | Agentic AI | Claude Code | Cloud Platforms | Data Pipelines

Employee stock purchase plan | Paid time off | Parental leave | Remote work options | Tuition assistance

Senior-level Full Time

United States R

3d ago
Senior Data Scientist - Customer Experience USD 132K-166K

A/B | A/B Testing | AI | Airflow | Applied statistics

Senior-level Full Time

United States R

3d ago
Data Scientist II, ML Infrastructure USD 114K-235K

Apache Airflow | Causal Inference | Code review | Distributed Computing | Feature importance

Entry-level Full Time

Palo Alto, CA, US; Remote, US R

3d ago
Lead Data Scientist, Revenue Analytics USD 138K-150K

A/B | A/B Testing | B testing | Causal Impact | Data Pipelines

401k | Dental insurance | Medical insurance | Paid time off | Stock options

Senior-level Full Time

Remote, US R

4d ago
Senior Principal Data Scientist - Remote USD 134K-230K

Agentic Systems | Convolutional Neural Networks | Deep learning | Gradient boosting | Language Models

Remote work flexibility

Senior-level Full Time

San Diego, California, UNITED STATES; REMOTE R

4d ago
Senior Lead Applied AI Scientist USD 132K-220K

APIs | Agentic Architectures | Cloud AI | Deep learning | Experimentation

Senior-level Full Time

Remote, US R

4d ago
Sr. Data Scientist - AI Voice USD 113K-154K

Cloud platform | Data Pipelines | Deep learning | FHIR | Google Cloud

401k match | Dental insurance | Disability insurance | Employee assistance program | Employee discount program

Senior-level Full Time

Massachusetts, United States R

4d ago
Lead Data Scientist - Healthcare USD 150K-174K

AWS | Azure | Cloud platform | Deep learning | Demand forecasting

Career development opportunities | Equal employment opportunity | Individual responsibility

Senior-level Full Time

New Jersey, United States - Remote R

4d ago
Data Science - Analyst 4 USD 225K-254K

A/B | A/B Testing | B testing | Causal Impact | Causal Inference

401k eligibility | Medical benefits | Paid time off | Parental leave

Mid-level Full Time

San Jose, United States R

4d ago
AVP/VP, Underwriting Analytics USD 105K-192K

Anomaly Detection | Cloud Computing | Clustering | Data Mining | Data Pipelines

401k savings | Employee assistance program | Health and welfare benefits | Hybrid work | Inclusive culture

Executive-level Full Time

Denver - Lawrence, United States R

4d ago
Senior Data Scientist USD 160K-180K

Azure DevOps | Azure Machine Learning | BigQuery | CI/CD | Cross-validation

Senior-level Full Time

United States - Remote R

4d ago
Senior Data Scientist (BlackLocus division) USD 90K-180K

Algorithms | Big Data | Data Mining | Data Modeling | Data analytics

401k | ESPP | Health care benefits | Paid time off | Success sharing bonus

Senior-level Full Time

TEXAS - VIRTUAL - TX01, United … R

4d ago
Data Scientist, Core Infrastructure USD 144K-261K

Cloud infrastructure | Data Modeling | Distributed Computing | Hadoop | Log Analysis

401k plan | Company bonus | Equity | Medical, dental, and vision benefits | Wellness stipends

Mid-level Full Time

San Francisco R

4d ago

Research Scientist, LLM Evaluation & Post-Training

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

Related jobs