Technical AI Policy Researcher, Model Behaviour - Trust and Safety
San Francisco, California, United States
USD 182K-258K (estimate) Senior-level Full Time
Tasks
- Build policy artifacts for model training evaluation and deployment
- Construct golden sets and labeling guidance
- Contribute to safety reports policy documentation and governance reviews
- Define boundaries for safe and unsafe AI use
- Design and maintain multimodal GenAI policies across safety domains
- Develop end to end policy workflows for pre launch evaluation and post launch monitoring
- Identify emerging safety fairness and bias challenges
- Monitor post launch model activity for unsafe behavior gaps
- Operationalize policy into scalable model behavior and measurable safeguards
- Perform calibration adjudication and evaluation coverage analysis
- Support regulatory teams as AI compliance subject matter expert
- Translate risk and harm models into behavioral specifications
- Use red teaming and deployment data to improve policies and evaluations
Perks/Benefits
- N/A
Skills/Tech-stack
AI Governance | AI Policy | AI compliance | Adjudication | Artificial Intelligence | Behavioral Specifications | Bias | Calibration | Coverage analysis | Evaluation | Evaluation Coverage Analysis | Fairness | Generative AI | Golden Set | Golden Set Construction | Labeling Guidance | Model Monitoring | Model risk | Model risk assessment | Multimodal AI | Red Teaming | Risk Assessment | Safety | System Safeguards
Education
N/A
Regions
Countries
States
Related jobs
-
AI Governance | Adjudication | Artificial Intelligence | Calibration | Data labelingSenior-level Full TimeSan Francisco, California, United States6h ago
-
Staff AI/ML Engineer USD 240K-270KAWS | Agentic Workflows | Azure | Data Curation | Deep learning401k matching | Commuter benefits | Comprehensive health benefits | Dog-friendly office | EquitySenior-level Full TimeNew York City, NY15h ago
-
Staff AI/ML Engineer USD 240K-270KAWS | Agentic Workflows | Azure | Data Curation | Data Pipelines401k | Commuter benefits | Dog-friendly office | Equity | FSA benefitsSenior-level Full TimeSan Francisco, CA15h ago
-
Sr. Manager, AI Lead - Semantic Layer - Remote USD 168K-224KAPI Integration | Analytics | Artificial Intelligence | Data Governance | Data ModelingRemote workSenior-level Full TimeCalifornia - Home Teleworkers, United States R18h ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Artificial Intelligence | Belief State Tracking | Caching | Causal modelingSenior-level Full TimeUnited States R18h ago
-
AI-assisted coding | API Integration | Assisted coding | Business Process | Business process automationLocal preferred | Remote work preferredMid-level ContractLos Angeles, United States2d ago
-
Responsible AI Program Manager, Google Public Sector USD 165K-239KAI Governance | AI Safety | Compliance | Executive Communication | GovernanceSenior-level Full TimeReston, VA, USA; Washington D.C., DC, …2d ago
-
AI Engagement Manager USD 145K-270KBigQuery | Business Intelligence | Generative AI | Machine Learning | SQLEqual opportunity employment | Pay equity commitment | Remote work optionMid-level Full TimeNew York2d ago
-
Algorithms | Angular | Bash | CSS | Continuous DeliveryCareer development | Hybrid work | Mentoring | Paid internshipEntry-level InternshipPalo Alto, CA, US, 943042d ago
-
AI/ML Platform Engineer USD 152K-205KArtificial Intelligence | Inference Parameter Tuning | Language Models | Large Language Models | OpenAI Compatible401k company match | Career development support | Comprehensive benefits and wellness packages | Hybrid work | Internal mobility supportSenior-level Full TimeUSA VA Sterling - 22626 Sally …2d ago
-
AI Engineer I - Hybrid USD 125K-135KAI Services | API Development | Agentic Workflows | Azure | Azure AIHealth insurance | Hybrid work | Paid time off | Remote work options | Retirement planSenior-level Full TimeWindsor, Colorado, United States R2d ago
-
AI Principal Technical Consultant, AI Services USD 155K-180KAPI | AWS | Agentic Workflows | Artificial Intelligence | Automated testingSenior-level Full TimeUnited States2d ago
-
AI Product Manager - GenAI and Agentic Capabilities USD 90K-150KAgentic AI | Agile | Experimentation | Generative AI | LLM EvaluationCareer development opportunities | Holistic well-being support | Visa sponsorshipMid-level Full TimeNew York, NY, United States2d ago
-
Lead, AI Engineering USD 180K-220KAWS Glue | AWS RDS | AWS SageMaker | Amazon Kinesis | Amazon Kinesis Firehose401k match | Medical/Dental/Vision | Paid holiday | Paid parental leave | Paid time offSenior-level Full TimeCharlotte, North Carolina, United States3d ago
-
Principal AI Engineer, Data USD 171K-269KArtificial Intelligence | Backend Development | CI/CD | Cloud Computing | Data DriftEmployee family well being programs | Health and wellness programs | Holistic lifestyle programs | Hybrid workSenior-level Full TimeWaltham, Massachusetts, United States3d ago
-
Senior Staff AI Engineer - Enterprise AI (Agentic AI) USD 191K-315KArtificial Intelligence | Language Models | Language Processing | Large Language Models | Machine LearningHealth and wellness programs | Time awaySenior-level Full TimeMountain View, CA, United States3d ago
-
AI Transformation Specialist (Remote, Contract) USD 82K-130KAgent workflows | Artificial Intelligence | Automation | Code Automation | Language ProcessingContract position | Remote workEntry-level ContractGeorgia R3d ago
-
AI Delivery Director USD 175K-225KAgile | Artificial Intelligence | Delivery management | Demand forecasting | Generative AIExecutive-level Full TimeUS-New Jersey3d ago
-
Anthropic Forward Deployed Engineer - GPS USD 97K-171KAPI Integration | AWS | Airflow | Anthropic Claude | Anthropic Claude APIMentorship | Professional development opportunities | Reasonable accommodations | Travel 50% | US government security clearance supportMid-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …3d ago
-
APIs | Agent systems | Cloud platform | Cost Per Request | CrewAIEmployee discounts | Health insurance | Paid time off | Retirement plansMid-level Full TimeSan Francisco, CA, USA; Atlanta, GA, …3d ago
-
Sr. Director, AI Program Management (Remote) USD 210K-300KAI Governance | AI RMF | AI Safety | GDPR | Generative AIAdoption leave | Competitive vacation and holidays | Employee networks | Great Place to Work certification | Paid parental leaveSenior-level Full TimeUSA CA Remote, United States R3d ago
-
Manager - Applied AI Delivery USD 170K-175KAWS | Agile | Agile Framework | Artificial Intelligence | Bias Mitigation401k match | AAA membership | Adoption Assistance | Company holidays | Discounts and rewardsMid-level Full TimeMI-Admin Office Building (AOB), United States R3d ago
-
AI Security Engineer USD 100K-141KAI Foundry | AWS Bedrock | Adversarial Emulation | Adversarial Machine Learning | Agentic AIPaid time offMid-level Full TimeChicago, United States3d ago
-
AI & Data Solution Architect USD 96K-192KAWS Glue | AWS Lambda | Amazon EMR | Amazon Redshift | Amazon S3Dental insurance | Health care benefits | Health savings account | Life insurance | Long-term disabilitySenior-level Full TimeCAG24: Atlanta Digital Hub, 3350 Riverwood …3d ago
-
VP II, Advisor AI Solutions - Product Management USD 169K-281KAPI Integration | Access Revocation | Adoption and change management | Agentic AI | Artificial IntelligenceExecutive-level Full TimeFort Mill/Charlotte, United States3d ago