Research Scientist, Safety Post Training
San Francisco, CA; New York, NY
USD 216K-270K Senior-level Full Time
Tasks
- Apply interpretability techniques
- Collaborate with policymakers and engineers
- Create interpretability informed evaluations
- Design post-training pipelines
- Develop post-training methods
- Guide targeted mitigations
- Identify unsafe behaviors
- Run post training pipelines
- Translate research into evaluation benchmarks
- Translate research into safety standards
Perks/Benefits
- Commuter stipend
- Comprehensive health insurance
- Dental insurance
- Learning and development stipend
- Paid time off
- Retirement benefits
- Vision insurance
Skills/Tech-stack
Adversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human Feedback | Learning from Human Feedback | Machine Learning | Mechanistic Interpretability | Policy Optimization | Preference optimization | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Reward Hacking
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Featured Feat. Data Scientist USD 80K-157KAWS | Airflow | Annotation | Azure | ClassificationHybrid work mode | Professional development opportunitiesMid-levelGeorgetown University: Main Campus: Walsh School …12d ago
-
Senior Research Data Scientist, AI Developer Tools USD 174K-252KExperiment design | Language Models | Large Language Models | Machine Learning | PythonSenior-level Full TimeSunnyvale, CA, USA1h ago
-
Artificial Intelligence | Automation | Biobank data | Colocalization | Computational BiologyHealth insurance | Other Perquisites | Paid time off | Retirement contributionsMid-level Full TimeBillerica, Massachusetts, US, 018215h ago
-
Applied AI Researcher, Multi-Agent Systems USD 150K-250KAgent Orchestration | Agent systems | Communication Protocols | Data Analysis | Graph Neural Networks401k | Commuter benefits | Dental insurance | Health insurance | Hybrid workExecutive-level Full TimeSan Francisco13h ago
-
Senior Data Scientist USD 150K-190KA/B | A/B Testing | AWS Redshift | AWS S3 | AWS SageMakerComprehensive health coverage | Dental coverage | Flexible PTO | Retirement benefits | Vision coverageSenior-level Full TimeBoulder, CO14h ago
-
Cancer Genomics | Deep learning | Foundation Models | Genome Sequencing | GenomicsEquity incentives | Long-term incentives | Medical benefits | Performance bonusSenior-level Full TimeFremont, CA16h ago
-
Senior Scientist II, Computational Chemistry USD 149K-186KADME and PK | Bash | C# | Cheminformatics | Computational ChemistrySenior-level Full TimeRedwood City, California, United States17h ago
-
Senior Data Scientist I USD 121K-202KAdversarial Networks | Anomaly Detection | C++ | Classification | Computer Vision401k | Dental insurance | Medical insurance | Paid time off | Vision insuranceSenior-level Full TimeNorth Chicago, IL, United States18h ago
-
Sr. Data Scientist, GenAI & Labeling Platforms USD 139K-287KA/B | A/B Testing | B testing | Bias Measurement | CalibrationEquity compensation | Flexible work scheduleSenior-level Full TimeSan Francisco, CA, US; Remote, US R18h ago
-
Data Scientist II, Infrastructure USD 114K-235KDashboards | Data Analysis | Data Pipelines | Data Quality | ExperimentationMid-level Full TimeSan Francisco, CA, US; Remote, US R18h ago
-
Data Scientist II, Experimentation USD 114K-235KA/B | A/B Testing | Apache Airflow | B testing | Data PipelinesMid-level Full TimeSan Francisco, CA, US; Remote, US R18h ago
-
Data/ML Scientist SME USD 105K-150KAWS GovCloud | Anomaly Detection | Apache Spark | Bayesian Causal Inference | Bayesian MethodsMid-level Full TimeFAIRFAX, VA, United States19h ago
-
Research Scientist - Compute AI Infra - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-387KAI Agents | Artificial Intelligence | CPU Scheduling | Cause analysis | Cloud ComputingEntry-level Full TimeSan Jose, California, United States1d ago
-
3D conformers | ASE | Agent systems | Alignment | Autoregressive modelsMid-level Full TimePasadena, CA1d ago
-
Deep Learning Algorithm Developer USD 100K-190K3D Reconstruction | Computer Vision | Deep learning | Language Processing | Low-shot learning401k employer match | Dental and vision coverage | Employee stock ownership plan | Employer paid medical insurance | HSA employer contributionsMid-level Full TimeFort Collins, CO, US1d ago
-
Data Scientist USD 125K-150KLanguage Processing | Machine Learning | Model Deployment | Model Evaluation | Natural LanguageMid-level Full TimeSunnyvale, California, United States1d ago
-
Mid-level Full TimeMcLean, VA, United States1d ago
-
Applied Scientist, Wayve Labs USD 147K-213KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | LanguageDaily yoga | Enhanced parental leave | Flexible working hours | Hybrid working | Large Social BudgetsMid-level Full TimeSunnyvale1d ago
-
Senior Clinical Data Scientist USD 152K-219KAI Governance | Anomaly Detection | CDASH | CDISC | CTMSCommunity engagement | Development programs | Educational assistance | Health and wellness initiatives | Hybrid workSenior-level Full TimeWest Lafayette, IN, United States1d ago
-
Data Scientist - Modeling and Analytics USD 120K-199KA/B | A/B Testing | AWS | Amazon SageMaker | Apache SparkSenior-level Full TimeLivonia, MI, United States1d ago
-
Lead Data Scientist USD 160K-270KData Analysis | Data Visualization | Databricks | Exploratory Data Analysis | Feature Engineering401k plan | Adoption reimbursement | Disability benefits | Employee assistance program | Employee discountsSenior-level Full TimeUSA:GA:Atlanta / 1025 Lenox Park Blvd …1d ago
-
Data Scientist USD 100K-162KActivity Based Intelligence | Alteryx | CSS | Data analytics | ESRI Model BuilderOn-site workMid-level Full TimeSpringfield, VA1d ago
-
CMC AI/ML and Automation Scientist USD 129K-209KAWS | Analytical technology | Artificial Intelligence | Azure | DashboardsSenior-level Full TimeUS: Indianapolis IN Tech Center North, …1d ago
-
Senior Director, Data Science - Head of Fair Lending Analytics - Fair & Responsible Banking Compliance USD 286K-359KAWS | Classification | Clustering | Conda | Confusion matrixSenior-level Full TimeRichmond, VA, United States1d ago
-
Data Scientist - Level 4 USD 156K-234KBayesian statistics | Bitbucket | Bootstrapping | C++ | ConfluenceDomestic travel | International travel | Relocation assistance not available | Top secret clearance requiredSenior-level Full TimeCOBO02, United States1d ago