Research Scientist, LLM Evaluation & Post-Training

Remote Work( USA), United States R

USD 150K-160K Senior-level Full Time

@ C...

Apply Save

Found 1mo ago

Tasks

Analyze model behavior and failure patterns
Build evaluation and post training pipelines with ML teams
Create benchmark datasets and evaluation reports
Define and execute LLM evaluation research agenda
Design experiments for post training outcomes
Develop evaluation frameworks and benchmarks
Implement scoring reliability and measurement validity
Improve evaluation redesign recommendations
Partner with customers to review evaluation methodologies
Publish research findings and contribute to open-source
Run human and automated evaluation studies

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en Views:

3 Clicks:

0 Saves: 0

Related jobs

Principal Applied AI Scientist - Agentic AI USD 190K-210K

Agentic Systems | Autogen | Cloud Computing | Data Quality | Data Systems

401k match | Flexible schedule | Health insurance | Paid parental leave | Paid time off

Senior-level Full Time

Work From Home, United States R

7h ago
Principal Applied AI Scientist - Predictive AI USD 190K-210K

Anomaly Detection | Big Data | Cloud Computing | Data Quality | Data quality assurance

401k match | Disability insurance | Employee assistance program | Flexible schedules | Health insurance

Senior-level Full Time

Work From Home, United States R

7h ago
Principal Applied AI Scientist - Agentic AI USD 190K-210K

Accuracy testing | Agentic Systems | Autogen | Cloud Computing | CrewAI

401k match | Dental insurance | Employee assistance program | Employee stock purchase plan | Flexible scheduling

Senior-level Full Time

Work From Home, United States R

7h ago
Senior Staff Data Scientist - Consumer Relevance USD 232K-325K

Causal Inference | Counterfactual evaluation | Experimental Design | Offline evaluation | Power analysis

401k employer match | Caregiving support | Comprehensive healthcare | Family planning support | Flexible vacation

Senior-level Full Time

Remote - United States R

1d ago
Data Scientist USD 160K-190K

AWS | Amazon Redshift | Amazon S3 | Apache Spark | Data Science

Senior-level Full Time

US - Remote R

1d ago
Senior Data Scientist — Applied Analytics (Data & AI) USD 118K-195K

Clustering | DBT | Data Modeling | Data Quality | Data pipeline

401k employer match | Paid parental leave | Paid time off and holidays | Tuition reimbursement

Senior-level Full Time

Raleigh, United States R

1d ago
AV Safety Engineering Analytics Engineer (GPSSC) USD 160K-246K

CI/CD | Dash | Docker | GitHub | Jenkins

Remote work

Mid-level Full Time

Work From Home - United States, … R

1d ago
Data Scientist USD 95K-140K

AWS | Artificial Intelligence | Automated testing | Azure | CI/CD

Educational and training opportunities | Hybrid work environment | Relocation reimbursement | Tuition reimbursement

Mid-level Full Time

4703 Madison Yards Way, Suite 700, … R

1d ago
Sr. AI Scientist - AI Detection and Response (AIDR) (Hybrid) USD 140K-215K

AI Agents | AWS | Agentic AI | CUDA | Deep learning

Competitive vacation and holidays | Comprehensive wellness programs | Employee networks | Great Place to Work certified | Paid adoption leave

Senior-level Full Time

Austin, United States R

1d ago
Senior Real-World Data Analytics Consultant/Senior Data Scientist Consultant (Remote) USD 114K-170K

Analysis Plan | Azure DevOps | Burden of illness | Comparative Effectiveness | Comparative Effectiveness Research

Career development opportunities | Remote work | Supportive culture

Senior-level Full Time

United States R

1d ago
Senior Staff Data Scientist - Consumer Experimentation USD 232K-325K

Bandit Algorithms | Bayesian Methods | Causal Inference | Cluster-randomization | Experimental Design

Caregiving support | Employer 401k match | Family planning support | Flexible vacation | Gender-affirming care

Senior-level Full Time

Remote - United States R

1d ago
Staff Data Scientist, Decisions - Partnership, Loyalty & Pay USD 176K-220K

A/B | A/B Testing | B testing | Causal Inference | Counterfactual analysis

401-k match | Child care benefits | Commuter benefits | Dental insurance | Family building benefits

Senior-level Full Time

New York, NY R

1d ago
Staff Data Scientist, Decisions - Partnership, Loyalty & Pay USD 176K-220K

A/B | A/B Testing | B testing | Causal Inference | Counterfactual analysis

401k match | Child care benefits | Family building benefits | Lyft Pink membership | Lyft credits

Senior-level Full Time

San Francisco, CA R

1d ago
Data Scientist 1 USD 105K-173K

AWS | Azure | Classification | Cloud platform | Clustering

401k match | Adoption Assistance | Education assistance | Employee resource groups | Family planning programs

Mid-level Full Time

Distributed (Wisconsin), United States R

2d ago
Sr. Data Scientist USD 120K-130K

A/B | A/B Testing | AWS | Azure | B testing

401k matching | Dental insurance | Employee stock purchase plan | Medical insurance | Paid Holidays

Senior-level Full Time

Connecticut, US Offsite, United States R

2d ago
Portfolio Risk Quantitative Modeler, Associate - Aladdin Financial Engineering USD 137K-170K

Artificial Intelligence | Backtesting | Covariance Matrix | Covariance Matrix Estimation | Data Quality

Flexible time off | Healthcare | Hybrid work model | Retirement benefits | Tuition reimbursement

Mid-level Full Time

NY7 - 50 Hudson Yards, New … R

2d ago
Senior Data Scientist, Delivery (Remote) USD 100K-190K

Computer Vision | Data Mining | Google BigQuery | Language Processing | Machine Learning

Senior-level Full Time

STORE SUPPORT CENTER, ATLANTA - 9090, … R

2d ago
Data Scientist USD 100K-163K

ARIMA | Anomaly Detection | Apache Spark | Data Analysis | Data Science

401k match | Bonuses | Employer paid health care | Hybrid schedule | Remote work

Senior-level Full Time

United States - Remote R

2d ago
Senior Applied Scientist - Search USD 200K

Data Science | Dataset design | Evaluation Dataset | Evaluation dataset design | Experimental infrastructure

401k retirement plan | Annual leave | Equity package | Growth opportunities | Hybrid work schedule

Senior-level Full Time

New York City R

2d ago
Computer Science Expert with Python Experience - AI Projects on Mindrift USD 160K

C# | MATLAB | NumPy | Pandas | Python

Flexible hours | Part-time project-based work | Remote work

Senior-level Full Time

United States - Remote R

3d ago
Process Analytics & Control Research Scientist USD 99K-149K

Analytical technology | Aspen Unscrambler | CGMP | Data Analysis | Data Interpretation

401k matching | Annual bonus | Annual equity awards | Commuting subsidy | Company Shutdown

Mid-level Full Time

5000 - Vertex US - Fan … R

3d ago
Data Scientist Senior (Population Health) USD 123K-176K

Agentic Workflow | Clustering | Data Validation | Databricks | Dimensionality Reduction

Healthcare benefits | Remote work

Senior-level Full Time

Work from home (Pennsylvania), United States R

4d ago
Staff Data Scientist - Experimentation & Measurement USD 212K-318K

A/B | A/B Testing | B testing | Causal Inference | Experimental Methods

401k matching | Dental insurance | Employee discounts | Medical insurance | Paid time off

Senior-level Full Time

United States, Remote R

4d ago
Senior Data Scientist - Analytical Data Product (Short Term) USD 252K-310K

Dashboard | Data Visualization | Data pipeline | ETL | Machine Learning

Onsite days schedule | Overtime pay

Senior-level Full Time

San Mateo, CA, United States R

4d ago
Lead Data Scientist USD 210K-240K

APIs | Apache Airflow | Apache Beam | Cloud Dataflow | Cloud Dataproc

401k | Dental insurance | Employee assistance program | Health insurance | Life insurance

Senior-level Full Time

Remote - USA R

4d ago

Research Scientist, LLM Evaluation & Post-Training

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

Related jobs