Researcher: Agent Post-Training, API & Power-Users
Tasks
- Build evals graders and environments
- Create feedback loops from power user traces
- Debug model failures using traces and evals
- Design and run model behavior experiments
- Develop model behavior hypotheses
- Develop training interventions with synthetic data and objectives
- Implement error recovery for long horizon tasks
- Improve tool use and function calling reliability
- Improve training launch reliability observability reproducibility cost latency
- Integrate post training improvements into major model runs
- Measure instruction following and factuality
- Own end to end model behavior projects
- Turn model failures into training data
Perks/Benefits
- N/A
Skills/Tech-stack
AI Feedback | Calibrated Reasoning | Data Generation | Deep learning | Error Recovery | Evals | Experiment design | Factuality | Function Calling | Human Feedback | Instruction following | LLM Evaluation | Language Models | Large Language Models | Learning from Human Feedback | Learning systems | Long Horizon Planning | Machine Learning | Machine learning systems | Model Observability | Production ML | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Reproducibility | Software Engineering | Statistical Analysis | Synthetic data | Systems Thinking | Tool use
Education
N/A
Regions
Countries
States
Related jobs
-
Software Engineer, Databases (Technical Leadership) USD 160K-293KAI | Automation | Consensus Protocols | Data Integrity | Database InternalsSenior-level Full TimeBellevue, WA | Menlo Park, CA1h ago
-
Account Security | Adversarial analysis | Anomaly Detection | Bias Mitigation | ClassificationSenior-level Full TimeMenlo Park, CA1h ago
-
AI Builder Intern USD 74K-111KAPI Integration | Anthropic API | Autogen | CrewAI | JavaScriptCommuter stipend | Comprehensive health dental and vision | Generous PTO | Learning and development stipend | Retirement benefitsEntry-level InternshipSan Francisco, CA; New York, NY7h ago
-
A/B | A/B Testing | B testing | Engagement modeling | Feature EngineeringSenior-level Full TimeSan Francisco8h ago
-
Audio Processing | Automatic gain control | Backend Infrastructure | C++ | Echo cancellationSenior-level Full TimeSan Francisco9h ago
-
Member of Technical Staff (AI Software Engineer, Agents) USD 220K-405KAI Evaluation | Browser technologies | CDP | Code Quality | Context engineeringSenior-level Full TimeSan Francisco9h ago
-
ADAS | Autonomous Vehicles | C++ | Camera | Data ProcessingCompany benefits program | Company bonus | Equity incentive plan | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA; San Francisco, …11h ago
-
Scientist, Bioinformatics USD 135K-186KATAC-seq | Bioinformatics | Cell-type annotation | Clustering | Differential expressionMid-level Full TimePalo Alto, CA12h ago
-
.NET | Angular | C# | C++ | CI/CD401k retirement plan | Company stock options | Dental insurance | Employee stock purchase plan | Life insuranceSenior-level Full TimeHawthorne, CA12h ago
-
Principal Applied Scientist USD 165K-331KApache Spark | Big Data | Data Science | Deep learning | Information RetrievalSenior-level Full TimeRedmond, WA, US12h ago
-
Data Scientist , AMXL Worldwide Science USD 136K-184KData Analysis | Data Ingestion | Data Pipelines | Feature Engineering | Large Scale DataMid-level Full TimeBellevue, Washington, USA R12h ago
-
Senior GenAI Software Engineer (North America) USD 165K-230KA/B | A/B Testing | B testing | Debugging | EvaluationEquity | Health, dental, and vision benefits | In person team gatherings quarterly | Remote-first work | Wellness stipendsSenior-level Full TimeUnited States R13h ago
-
Senior Software Engineer, AI Developer Experience USD 202K-230KAPI Integration | Agentic Workflows | Artificial Intelligence | Code review | Command LineCareer coaching and support | In-office culinary options | Inclusive family building benefits | Long term savings or retirement plans | Mental health wellness and fitness benefitsSenior-level Full TimeNew York City R13h ago
-
Lead - POC Data Science USD 200K-280KA/B | A/B Testing | Anomaly Detection | Apache Spark | B testing401k matching | Dental insurance | Flexible paid time off | Health and wellness stipend | Health insuranceSenior-level Full TimeUnited States R14h ago
-
Data Analytics Engineer USD 160K-195KATE | Advantest | Anomaly Detection | Cause analysis | CredenceSenior-level Full TimeSan Jose, California, United States14h ago
-
Machine Learning Scientist, BioML USD 200K-330KAWS | Azure | Bioinformatics | Cloud Computing | Computational Biology401k employer match | Equity participation | Health, dental, vision insurance | Paid time off | Professional developmentMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R14h ago
-
Data & Digital Solutions Engineer USD 86K-118KAPI | AWS | Application development | Artificial Intelligence | AutomationMid-level Full TimeUS > Arizona > Phoenix14h ago
-
Research Robotics/Computer Vision Engineer USD 250K-300K3D SLAM | AWS | Articulated Object Reconstruction | Attention Models | Autonomous NavigationSenior-level Full TimeSan Mateo15h ago
-
Mid Level AI Engineer USD 98K-158KAWS | Anthropic | Autogen | Azure OpenAI | Backend Development401k match | Commuter benefits | Dental insurance | Employee Discount Marketplace | FSAMid-level Full TimeUS - Orlando15h ago
-
Machine Learning Platform Engineer USD 135K-160KAmazon SageMaker | Apache Flink | C++ | CI/CD | Cloud PubSub401k match | Annual bonus | Company equipment provided | Company medical dental vision plans | Disability benefitsMid-level Full TimeAtlanta, GA preferred, Remote R15h ago
-
Senior Research Engineer, Olmo + Molmo USD 146K-220KAgentic Systems | Amazon Web Services | Cloud Computing | Cloud platform | ContainerizationFamily leave | Paid sick leave | Paid vacationSenior-level Full TimeSeattle, WA15h ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KAgent Orchestration | Agent systems | Artificial Intelligence | Autonomous Reasoning | Fine TuningSenior-level Full TimeRemote-USA R15h ago
-
Artifact management | Blue-Green Deployment | Blue/green | CI/CD | Cost-aware scheduling401k company match | Dental insurance | Flexible work schedule | Life insurance | Medical insuranceSenior-level Full TimeLos Angeles, USA15h ago
-
Data Scientist, Product USD 155K-185KA/B | A/B Testing | B testing | Data Analysis | Data VisualizationMid-level Full TimeSeattle, Washington, United States15h ago
-
Data Scientist, Product USD 155K-185KData Analysis | Experimentation | Machine Learning | Python | RMid-level Full TimeMountain View, CA15h ago