Research Scientist, Safety Post Training
San Francisco, CA; New York, NY
USD 216K-270K Senior-level Full Time
Tasks
- Apply interpretability techniques
- Collaborate with policymakers and engineers
- Create interpretability informed evaluations
- Design post-training pipelines
- Develop post-training methods
- Guide targeted mitigations
- Identify unsafe behaviors
- Run post training pipelines
- Translate research into evaluation benchmarks
- Translate research into safety standards
Perks/Benefits
- Commuter stipend
- Comprehensive health insurance
- Dental insurance
- Learning and development stipend
- Paid time off
- Retirement benefits
- Vision insurance
Skills/Tech-stack
Adversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human Feedback | Learning from Human Feedback | Machine Learning | Mechanistic Interpretability | Policy Optimization | Preference optimization | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Reward Hacking
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Sr Director I, Data Science USD 97K-157KCode review | Data Science | Data Science Governance | Data leakage | Distribution ShiftEmployee resource groups | Professional development opportunities | Workplace flexibilitySenior-level Full TimeRemote, Remote, United States R13h ago
-
Senior-level Full TimeMaryland-Silver Spring14h ago
-
Senior-level Full TimeMaryland-Silver Spring15h ago
-
Senior Data Scientist - Government & Public Services USD 113K-208KClass imbalance | Cloud Computing | Data Exploration | Data Preparation | Data leakageSenior-level Full TimeArlington/Rosslyn, Virginia, United States15h ago
-
Data Scientist, Product Analytics USD 178K-204KAgent Orchestration | Bias Mitigation | Data Mining | Experimentation | Key MetricsAnalytics community | Career growthMid-level Full TimeMenlo Park, CA16h ago
-
Visiting Researcher, FAIR (University Grad) USD 145K-204KData Governance | Data Quality | Human-in-the-loop | Language Models | Language ProcessingSenior-level Full TimeMenlo Park, CA16h ago
-
Data Scientist, Product Analytics USD 201K-235KA/B | A/B Testing | B testing | Causal Inference | DashboardsSenior-level Full TimeMenlo Park, CA16h ago
-
AI Rater | Cohort Analysis | Data Analysis | Exploratory Data Analysis | Language ModelsSenior-level Full TimeMountain View, CA, USA; New York, …16h ago
-
Mid-level Full TimeSan Jose, CA, USA; New York, …16h ago
-
Senior Data Scientist USD 201K-248KCausal Inference | Cloud Computing | Data Analysis | Data Engineering | ETL401k | Daily team lunches | Gym allowance | Learning subscriptions | Medical, dental, and vision coverageSenior-level Full TimeRemote R21h ago
-
Senior Data Scientist, Engine Systems USD 221K-263KA/B | A/B Testing | B testing | Causal Inference | Crash ReportingSenior-level Full TimeSan Mateo, CA, United States23h ago
-
Transportation Data Scientist USD 116K-120KData Analysis | Data Visualization | Machine Learning | Python | SQLHybrid work | Local travel | Paid Holidays | Paid time off | Transit benefitMid-level Full TimeDenver, CO, US R1d ago
-
Staff Product Data Scientist, Expansion USD 230K-284KA/B | A/B Testing | B testing | Causal Inference | Data PipelinesDiscretionary annual bonus program | Equity incentive plan | Generous company benefits program | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA; San Francisco, …1d ago
-
Senior-level Full TimeSeattle, WA, United States1d ago
-
Manager, HR - Data Science & Continuous Improvement USD 120K-266KContinuous Improvement | Data Governance | Data Visualization | HR Analytics | Machine LearningSenior-level Full TimeHouston 1400 Smith Street, United States1d ago
-
Senior Manager, Data Science USD 122K-228KAWS SageMaker | Artificial Intelligence | Azure ML | Data Exploration | Data GovernanceAccident insurance | Health insurance | Hybrid work option | Life insurance | Remote work optionSenior-level Full TimeH1586 - HB San Francisco, CA …1d ago
-
Decision Scientist USD 89K-101KData Visualization | ELT | ETL | Hive | Machine Learning401k match | Dental insurance | Employee stock purchase program | Flexible time off | Hybrid workMid-level Full TimeHybrid - Denver, United States R1d ago
-
Decision Scientist I USD 82K-108KExcel | IBM DB2 | Machine Learning | Microsoft Office | Microstrategy401k plan | Paid Holidays | Paid vacation | Sick leaveMid-level Full TimeCharlotte NC - 214 North Tryon …1d ago
-
Senior Principal Machine Learning Scientist, AIBT USD 202K-376KCellular Systems Biology | Cellular systems | Deep learning | Epigenomics | Foundation ModelsRelocation benefitsSenior-level Full TimeSouth San Francisco, United States1d ago
-
Mid-level Full TimeCharlotte, United States1d ago
-
Data Scientist I - Fraud Model Governance USD 107K-193KData Modeling | Data Visualization | Graph Analysis | Machine Learning | Model MonitoringAnnual discretionary plan | Flexible benefits | In-office collaboration | Paid time offMid-level Full TimeCharlotte, United States1d ago
-
Sr Data Scientist USD 104K-156KAnomaly Detection | BI Dashboards | Classification | Data Pipelines | Deep learningFlexible time off | Health insurance | Life insurance | Retirement plan | Travel up to 10 percentSenior-level Full TimeFort Worth - Main, United States1d ago
-
Data Scientist Senior USD 106K-143KAgile Development | Cloud Pak | Cloudera | Data Analysis | Data Engineering401k match | Dental insurance | Disability insurance | Flexible work schedule | Health insuranceSenior-level Full TimeUSA NC Fort Bragg - 2929 …1d ago
-
Senior Expert I, Data Science USD 126K-234KBioinformatics | Bulk RNA-seq | CRISPR | Data Mining | Data VisualizationSenior-level Full TimeLaJolla/SD, United States1d ago
-
Artificial Intelligence | Deep learning | High Performance | High-Performance Computing | Language ProcessingRemote workMid-level Full TimeRemote Location, United States R1d ago