Researcher, Alignment Oversight
Tasks
- Analyze deployment data
- Build research prototypes and production systems
- Collaborate cross-functional teams
- Conduct red teaming
- Deploy action monitoring
- Design alignment experiments
- Develop alignment evaluations
- Ensure reliability and independence of oversight process
- Evaluate model failure modes
- Feed oversight signals into training
- Implement human in the loop control
- Implement oversight systems
- Publish research results
Perks/Benefits
Skills/Tech-stack
Evaluation Design | Experimentation | Human-in-the-loop | Language Models | Large Language Models | Machine Learning | Model Evaluation | Model Training | Preference optimization | Red Teaming | Reinforcement Learning | Scalable oversight | The Loop
Education
Regions
Countries
States
Related jobs
-
Lead Quantitative UX Researcher USD 150K-185KA/B | A/B Testing | B testing | Bayesian statistics | Behavioral Modeling401k match | Annual bonus | Annual performance reviews | Career development support | Company equipmentSenior-level Full TimeAtlanta, GA preferred, Remote R17h ago
-
Postdoctoral Researcher, AI and Systems Co-design Team USD 112K-145KAlgorithms | Compilers | Computer Architecture | Distributed Systems | Machine LearningEntry-level Full TimeMenlo Park, CA22h ago
-
Quantitative Trader USD 150K-216KAlgorithmic trading | Deep learning | Linear Models | Machine Learning | Quantitative AnalysisCollaborative work environment | Firmwide educational curriculum | Hands-on trainingMid-level Full TimeNew York, New York, United States22h ago
-
Quantitative Researcher USD 150K-200KDistributed Training | Feature Engineering | Hyperparameter Tuning | Machine Learning | PythonSenior-level Full TimeNew York, New York, United States22h ago
-
Sr. Applied Scientist, Amazon Robotics USD 167K-226KAI Reasoning | Algorithm Development | Classical AI | Classical AI Reasoning | Language ModelsSenior-level Full TimeBoston, Massachusetts, USA1d ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Belief State Tracking | C++ | Decision Making | Distributed Reinforcement LearningSenior-level Full TimeUnited States R1d ago
-
Computational Chemistry | Field development | Force Field | Force field development | Machine LearningDental insurance | Employee assistance program | Flexible spending accounts | Health insurance | Life insuranceMid-level Full TimeGILMAN - Gilman Hall, United States2d ago
-
Research Scientist, Frontier Health, DeepMind USD 174K-252KClinical Reasoning | Evaluation | Experimentation | GRPO | Human evaluationMid-level Full TimeMountain View, CA, USA3d ago
-
Research Scientist in Generative AI Graduate (Intelligent Creation) - 2026 Start (PhD) USD 136K-250K3D Generation | Artificial Intelligence | Computer Vision | Deep learning | Generative AIEntry-level Full TimeSan Jose, California, United States4d ago
-
Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD) USD 202K-368KCoding | Data Construction | Fine Tuning | Instruction Tuning | Language ModelsInternshipEntry-level Full TimeSan Jose, California, United States5d ago
-
Bash | ESPnet | Linux | Machine Learning | PyTorchSenior-level Full TimeUS-MD-COLUMBIA-720 ~ 9861 Broken Land Pkwy …6d ago
-
Bioinformatics | Deep learning | Dynamic Gene Regulatory Networks | Gene Regulatory Networks | MATLABNone Full TimeLocation S, United States6d ago
-
Applied Researcher, Perception USD 139K-201KComputer Vision | High Performance | High-Performance Computing | Language Models | Language ProcessingHybrid work environmentNone Full TimeMountain View, California; Pittsburgh, Pennsylvania; San …6d ago
-
Amazon Redshift | C++ | Econometrics | Machine Learning | NumPyBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersEntry-level Full TimeNew York, NY, United States6d ago
-
Quantitative Researcher, Quantitative Strategies USD 150K-200KBacktesting | Data Mining | LLMs | Language Processing | Machine LearningComprehensive benefitsSenior-level Full TimeNew York, New York, United States …7d ago
-
Cybersecurity Expert - RL USD 130K-200KAWS | Bash | Cloud platform | CrowdStrike | Cyber ThreatHigh autonomy | Hybrid work | In person Bangalore officeSenior-level Full TimeRemote R7d ago
-
Staff AI Researcher USD 148K-210KData Preprocessing | Deep learning | Distributed Systems | Feature Engineering | Fine Tuning401k match | Dental insurance | Educational reimbursement | Flexible work schedule | Health insuranceSenior-level Full TimeRemote, United States R7d ago
-
Artificial Intelligence | Automation | Data Modeling | Data Processing | Data analyticsBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersExecutive-level Full TimeNew York, NY, United States7d ago
-
Sr. Responsible AI Researcher, AI.x USD 160K-259KAI ethics | Adversarial Robustness | Alignment | Artificial Intelligence | Bias detectionSenior-level Full TimeSan Francisco, CA, United States8d ago
-
AI Researcher, AI.x USD 150K-275KAI reliability | Agentic Systems | Deep learning | Experimentation | GenAIHybrid work | On-site collaborationSenior-level Full TimeSan Francisco, CA, United States8d ago
-
Algorithmic trading | Automated Execution | Data Analysis | Econometrics | Execution Strategy OptimizationAnnual discretionary bonus | Flexible time off | Healthcare benefits | Hybrid work model | Retirement benefitsSenior-level Full TimeNY7 - 50 Hudson Yards, New … R8d ago
-
AI Governance | AI Policy | AI compliance | Adjudication | Artificial IntelligenceSenior-level Full TimeSan Francisco, California, United States8d ago
-
AI Governance | Adjudication | Artificial Intelligence | Calibration | Data labelingSenior-level Full TimeSan Francisco, California, United States8d ago
-
Machine Learning Researcher USD 250K-300KData Preprocessing | Deep learning | Feature Engineering | JAX | Machine LearningDiscretionary bonus | Insurance | Paid leaveMid-level Full TimeChicago, United States; New York, United …9d ago
-
Researcher, Context - Agent Post-Training USD 250K-380KData Pipelines | Deep learning | Experimentation | Grading systems | Language ModelsMid-level Full TimeSan Francisco11d ago