Researcher: Agent Post-Training, API & Power-Users
Tasks
- Build evals graders and environments
- Create feedback loops from power user traces
- Debug model failures using traces and evals
- Design and run model behavior experiments
- Develop model behavior hypotheses
- Develop training interventions with synthetic data and objectives
- Implement error recovery for long horizon tasks
- Improve tool use and function calling reliability
- Improve training launch reliability observability reproducibility cost latency
- Integrate post training improvements into major model runs
- Measure instruction following and factuality
- Own end to end model behavior projects
- Turn model failures into training data
Perks/Benefits
- N/A
Skills/Tech-stack
AI Feedback | Calibrated Reasoning | Data Generation | Deep learning | Error Recovery | Evals | Experiment design | Factuality | Function Calling | Human Feedback | Instruction following | LLM Evaluation | Language Models | Large Language Models | Learning from Human Feedback | Learning systems | Long Horizon Planning | Machine Learning | Machine learning systems | Model Observability | Production ML | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Reproducibility | Software Engineering | Statistical Analysis | Synthetic data | Systems Thinking | Tool use
Education
N/A
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R15h ago
-
Staff AI Engineer - Conversational & Agentic AI USD 176K-308KA/B | A/B Testing | API Design | Agent systems | Artificial Intelligence401k plan | Company match | ESPP | Family leave programs | Flexible spending accountsSenior-level Full TimeSanta Clara, California, United States9h ago
-
Sr AI Engineer USD 84K-105K3D Printing | Audio signal processing | C# | Data Preprocessing | Digital SignalAccrued Paid Vacation | Commuter benefits | Dental insurance | Flexible spending account | Health savings accountSenior-level Full TimeColumbia, MARYLAND, United States12h ago
-
Senior-level Full TimePlymouth, MI, United States13h ago
-
Senior Staff AI Engineer USD 139K-185K3D Printing | AI Algorithms | Audio signal processing | C# | Data PreprocessingAccrued Paid Vacation | Commuter benefits | Dental insurance | Employee resource groups | Flexible spending accountSenior-level Full TimeColumbia, MARYLAND, United States14h ago
-
Founding Engineer USD 110K-160KAPIs | Automated Evaluation | Fine Tuning | Infrastructure | Language ProcessingMid-level Full TimeSan Francisco, CA, US14h ago
-
AWS | Agentic AI | Azure | CI/CD | Cloud platform401k | Medical | Paid sick leaveMid-level ContractSouth San Francisco, United States14h ago
-
Senior-level Full TimeMiami, New York, San Francisco14h ago
-
Research Scientist - LLM Training System as a Service - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 202K-368KCUDA | Distributed Training | GPU Performance | GPU Performance Optimization | Language ModelsEntry-level Full TimeSan Jose, California, United States15h ago
-
Senior Databricks Forward Deployed Engineer - GPS USD 155K-306KAirflow | CI/CD | DBT | Data Modeling | DatabricksMentorship | Professional development | Travel for client workSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …15h ago
-
Lead Databricks Forward Deployed Engineer - GPS USD 189K-372KAPI | AWS | Agent Bricks | Airflow | Apache SparkSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …15h ago
-
Entry-level Full TimeArlington/Rosslyn, Virginia, United States16h ago
-
Databricks Senior Consultant USD 124K-207KAWS | Azure | Business Intelligence | Cloud Computing | Cloud platformSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Sacramento, California, …16h ago
-
Delivery Senior Consultant, Data Engineering and Gen AI USD 155K-265K.NET | AWS | Agile | Angular | AzureMentorship opportunities | Professional development | Travel reimbursementSenior-level Full TimeGilbert, Arizona, United States; Lake Mary, …16h ago
-
AI and Data Science Engineer (TS/SCI Poly) USD 93K-170KAPIs | Artificial Intelligence | CI/CD | Cloud Platforms | ContainerizationMid-level Full TimeMcLean, Virginia, United States16h ago
-
Azure Databricks Developer USD 126K-198KApache Spark | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake StorageSenior-level Full TimeLouisville, Kentucky, United States16h ago
-
Sr. Data Scientist USD 131K-198KMachine Learning | Python | R | SQL | Statistical modelingHybrid work schedule | Limited travelSenior-level Full TimeRaleigh, North Carolina, United States16h ago
-
Senior Data Scientist - Government & Public Services USD 131K-218KClass imbalance | Cloud Computing | Data Exploration | Data Preparation | Data leakageSenior-level Full TimeArlington/Rosslyn, Virginia, United States16h ago
-
Delivery Senior Consultant, Data Engineering and Gen AI USD 155K-265K.NET | AWS | Agile | Angular | AzureMentorship | Professional development | Travel opportunitiesSenior-level Full TimeGilbert, Arizona, United States; Lake Mary, …16h ago
-
Generative AI Engineer III - Federal Health USD 110K-218KArtificial Intelligence | Data Engineering | Data Pipelines | Data Validation | DockerMentorship opportunities | Professional developmentSenior-level Full TimeArlington/Rosslyn, Virginia, United States16h ago
-
Data Engineer III (Secret Clearance Required) USD 107K-179KAWS | Anomaly Detection | Artificial Intelligence | Classification | ClusteringProfessional developmentSenior-level Full TimeArlington/Rosslyn, Virginia, United States16h ago
-
Lead AI and Data Solutions Engineer II USD 134K-224KAmazon Web Services | Apache Spark | Application Programming | Application Programming Interfaces | Cloud platformMentorship | Professional developmentSenior-level Full TimeSacramento, California, United States; Tempe, Arizona, …16h ago
-
Machine Learning Engineer, Ads Creative USD 194K-355KData Analysis | Deep learning | Machine Learning | Model Training | TargetingSenior-level Full TimeSan Jose, California, United States16h ago
-
Research Engineer - MSL FAIR Foundations USD 141K-208KAudio Processing | Benchmarking | Code review | Data Pipelines | Deep learningMid-level Full TimeMenlo Park, CA | Seattle, WA …17h ago
-
Performance & Capacity Engineer - Planning Optimization USD 147K-208KAI Models | Agent Orchestration | Artificial Intelligence | Bias Mitigation | Bin packingSenior-level Full TimeBellevue, WA | Menlo Park, CA …17h ago