Research Scientist, Safety Post Training
San Francisco, CA; New York, NY
USD 216K-270K Senior-level Full Time
Tasks
- Apply interpretability techniques
- Collaborate with policymakers and engineers
- Create interpretability informed evaluations
- Design post-training pipelines
- Develop post-training methods
- Guide targeted mitigations
- Identify unsafe behaviors
- Run post training pipelines
- Translate research into evaluation benchmarks
- Translate research into safety standards
Perks/Benefits
- Commuter stipend
- Comprehensive health insurance
- Dental insurance
- Learning and development stipend
- Paid time off
- Retirement benefits
- Vision insurance
Skills/Tech-stack
Adversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human Feedback | Learning from Human Feedback | Machine Learning | Mechanistic Interpretability | Policy Optimization | Preference optimization | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Reward Hacking
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
Principal AI/ML Scientist USD 150K-207KAWS | AWS GovCloud | Artificial Intelligence | Azure | Azure AIPublic trust suitabilitySenior-level Full TimeARLINGTON, VA, United States14h ago
-
Senior-level Full TimeARLINGTON, VA, United States14h ago
-
Data Science Intern USD 60K-95KAnomaly Detection | Computer Vision | Data Analysis | Data Preprocessing | ExcelCareer Development Training Opportunities | Diversity and inclusion initiatives | Multicultural environmentEntry-level InternshipAUBURN HILLS HQ R&D,US-MI,United States15h ago
-
Machine Learning Leader - Optical Solutions USD 180K-300KAnomaly Detection | Data analytics | Image Processing | Java | Machine LearningAdoption Assistance | Disability insurance | Educational assistance | Flexible spending account | Health savings accountSenior-level Full TimeFremont, California18h ago
-
Sr. Staff Data Scientist- Eng USD 145K-209KAgent systems | Agentic AI | BigQuery | Classification | Data GovernanceSenior-level Full TimeLowell,MA,United States R20h ago
-
Junior Data Scientist USD 70K-85KA/B | A/B Testing | B testing | Cloud platform | Feature EngineeringEntry-level Full TimeDenver, Colorado, United States20h ago
-
Data Scientist (Brooklyn Nets - Basketball Operations) USD 150K-170KAWS | Azure | Cloud Computing | Cloud platform | Computer VisionMid-level Full TimeBrooklyn, NY 112321d ago
-
Data Scientist (Remote) USD 140K-215KContext Management | DPO | DeepSpeed | Experiment tracking | Experimental DesignEmployee networks | Great Place to Work certification | Paid adoption leave | Paid parental leave | Professional developmentMid-level Full TimeUSA VA Remote, United States R1d ago
-
Lead Data Scientist, Stars Population Health USD 142K-195KCloud Computing | Data Engineering | Data Modeling | Data segmentation | Healthcare Analytics401k retirement savings | Bi weekly internet expense stipend | Paid time off | Remote workSenior-level Full TimeRemote US, United States R1d ago
-
E T Consultant - Talent Analytics and Insights USD 104K-145KDashboard Development | Data Analysis | Data Governance | Data Quality | Data VisualizationMid-level Full TimeWashington, DC, United States; Washington, DC,United …1d ago
-
Data Scientist (Remote) USD 40K-50KDevOps | Git | Machine Learning | Python | R401k matching | Charitable Gift Matching | Dental benefits | Employee stock purchase plan | Health benefitsSenior-level Full TimeRemote - UT, United States R1d ago
-
E T Consultant - Data Scientist USD 125K-188KCloud Computing | Data Pipelines | Data Processing | Distributed data | Distributed data processingMid-level Full TimeWashington, DC, United States; Washington, DC,United …1d ago
-
Senior Actuarial Data Scientist (Hybrid) USD 113K-194KBig Data | Data Imputation | Data Visualization | GBM | Generalized Linear Models401k contribution | Non sponsorship | Paid Holidays | Paid family leave | Paid time offSenior-level Full TimeAF-WI Madison Natl HQ, United States R1d ago
-
Investigative Data Scientist/AI Engineer USD 95K-166KAgentic Workflows | Cloud Computing | Data Engineering | Data Visualization | Deep learning401k match | Bereavement leave | Education assistance | Employee resource groups | Employee stock purchase programMid-level Full TimeWashington DC, United States1d ago
-
ANOVA | AWS | Agile | Azure | Clustering401k savings plan | Career development | Disability benefits | Employee assistance program | Flexible spending accountsEntry-level Full TimeChicago, Illinois, United States1d ago
-
Senior Data Scientist - Artificial Intelligence R&D USD 112K-183KAWS | Azure | Cloud Computing | Computer Vision | Data labeling401k savings plan | Adoption benefits | Career development | Disability benefits | Employee assistance programSenior-level Full TimeChicago, Illinois, United States1d ago
-
Lead Data Scientist (Service Events & Insights) USD 128K-208KData Modeling | Data Visualization | Database systems | Forecasting | Language Processing401k savings plans | Adoption benefits | Career development | Disability benefits | Employee assistance programSenior-level Full TimePeoria, Illinois, United States1d ago
-
Senior Data Scientist USD 97K-178KAWS | BERT | Bayesian statistics | Data Pipelines | Data Wrangling401k match | Company pension plan | Disability insurance | Education benefit | Employee stock purchase planSenior-level Full TimeWash, 213 Washington St., Newark, NJ, … R1d ago
-
Senior Specialist, Data Science USD 129K-203KAPI | AWS | Agent Orchestration | Agentic AI | Bayesian Modeling401k retirement plan | Hybrid work | Paid Holidays | Relocation | VacationSenior-level Full TimeUSA - Pennsylvania - West Point, …1d ago
-
Senior Specialist, Data Science USD 129K-203KData Engineering | Data Modeling | Data Science | Data Science Solutions | Data VisualizationHybrid workSenior-level Full TimeUSA - New Jersey - Rahway, …1d ago
-
Data Scientist - Inference, Community Support USD 151K-175KBayesian Modeling | Causal Inference | Experimental Design | Feature Engineering | Machine LearningEmployee travel credits | Inclusion and Belonging Culture | Remote eligibleSenior-level Full TimeRemote - USA R1d ago
-
Data Scientist - Algorithms, Community Support USD 151K-175KCausal Inference | Language Models | Language Processing | Large Language Models | Machine LearningSenior-level Full TimeRemote - USA R1d ago
-
Data Scientist, RNA Biology USD 135K-180KAlphafold | Data Analysis | Data Curation | Machine Learning | Next-Generation SequencingHybrid work modelMid-level Full TimeSouth San Francisco, California, United States2d ago
-
Senior Applied Scientist - Search USD 200K-200KData Science | Fine Tuning | Information Retrieval | Knowledge graphs | Language Models401k retirement | Dental insurance | Equity package | Health insurance | Hybrid work scheduleSenior-level Full TimeNew York City R2d ago