Agent Post-Training, API & Power Users
Tasks
- Build evaluation systems graders and environments from real workflows
- Convert observed model failures into training data
- Create feedback loops using power user traces and API usage patterns
- Debug hard model failures using traces evals training data and product context
- Design and run model behavior experiments for API workflows
- Develop behavior improvement projects from failure analysis to integration and launch readiness
- Develop training and alignment interventions using objectives and synthetic data
- Improve large scale training and launch reliability observability and cost latency
- Improve tool use planning and long horizon execution reliability
- Lead cross functional projects for multi agent systems and production like environments
Perks/Benefits
- N/A
Skills/Tech-stack
AI Feedback | Agent systems | Computer use | Cost Optimization | Data Generation | Deep learning | Evaluation | Experimentation | Function Calling | Human Feedback | Language Models | Language Processing | Large Language Models | Latency optimization | Learning from Human Feedback | Long Horizon Execution | Machine Learning | Model Debugging | Model Training | Multi-Agent | Multi-Agent Systems | Natural Language | Natural Language Processing | Observability | Planning | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Reproducibility | Statistical modeling | Synthetic data | Tool use
Education
N/A
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R1d ago
-
Senior-level Full TimeHouston, TX, US5h ago
-
AI Research Lead USD 96K-104KAPIs | AWS | Cloud infrastructure | Code review | Copyright ProtectionCompany paid life insurance | Medical, dental & vision coverage | Mental well-being resources | Paid parental leave | Paid sick daysSenior-level Full TimeNew York, NY, US, 102817h ago
-
Senior-level Full TimeNew York, NY11h ago
-
Senior Applied AI Scientist USD 182K-220KCausal Inference | Data Pipelines | Experiment design | LLM | Machine LearningSenior-level Full TimeNew York, NY11h ago
-
AI Engineer USD 149K-184KAPI Integration | EHR Integration | Evaluation | Language Models | Large Language ModelsSenior-level Full TimeNew York, NY12h ago
-
Senior AI Engineer USD 182K-220KAgent workflows | Arize Phoenix | Compliance | Cost Optimization | EmbeddingsSenior-level Full TimeNew York, NY12h ago
-
AI Engineer USD 78K-82KAPI Integration | CI/CD | Containerization | Csharp | Function Calling401k match | Flexible spending account | Health savings account | Legal ID Shield | Life insuranceEntry-level Full TimeTroy, MI, US12h ago
-
AI Scientist, Computational Protein Design USD 120K-240KArtificial Intelligence | Deep learning | Distributed Training | GPU Computing | Generative AIMid-level Full TimeSouth San Francisco, California, United States13h ago
-
Staff Research Engineer, Data Agents USD 190K-270KAgent systems | Agentic reinforcement learning | LLM Agents | Language Models | Language ProcessingSenior-level Full TimeSan Francisco, California13h ago
-
Machine Learning Engineer USD 101K-224KAgentic Workflows | Embeddings | Fine Tuning | Hugging Face | LLM orchestrationSenior-level Full TimeBellevue, WA, US, 9800414h ago
-
Machine Learning Operations Engineer (MLOps) USD 101K-224KAlerting | Azure | CI/CD | Docker | Inference OptimizationSenior-level Full TimeBellevue, WA, US, 9800414h ago
-
Senior AI Engineer USD 145K-181KAWS | Alerting | Azure | Docker | Embeddings401k match | Commuter benefits | Dental | Healthcare | Remote friendly workplaceSenior-level Full Time3750 Market Street, Philadelphia, PA, United … R14h ago
-
Senior-level Full TimeFort Meade, MD14h ago
-
Senior Machine Learning Engineer USD 180K-250KComputer Vision | Data Pipelines | Data labeling | Deep learning | Embedding Models100 percent remote | 13 paid holidays | 401k plan | Dental insurance | Medical insuranceSenior-level Full TimeRemote USA R14h ago
-
Senior AI Engineer USD 250K-300KAPI Development | Artificial Intelligence | Cost Optimization | GitHub | Inference Optimization401k match | Co working sessions | Flexible PTO | Health and wellness allowance | Health insuranceSenior-level Full TimeSan Francisco (Hybrid) R15h ago
-
Sr. Data Scientist, Performance Marketing USD 139K-287KCausal Inference | Dashboarding | ETL | Experimentation | ForecastingSenior-level Full TimeSan Francisco, CA, US; Remote, US R15h ago
-
Staff Machine Learning Engineer USD 278K-330KAutomatic Speech Recognition | Cloud Computing | Data Augmentation | Data Pipelines | Data PreprocessingSenior-level Full TimeMountain View, CA15h ago
-
Senior Machine Learning Engineer USD 230K-265KAutomatic Speech Recognition | Cloud Computing | Data Augmentation | Data Preprocessing | Decoding strategiesSenior-level Full TimeMountain View, CA15h ago
-
Machine Learning Engineer III, Core Agents USD 175K-219KAgent Orchestration | Embeddings | Evaluation | Indexing | Information RetrievalSenior-level Full TimeRedwood City, CA15h ago
-
Senior AI Engineer USD 190K-316KAI orchestration | Agent Frameworks | Context Management | Evaluation | LLM401k match | Commuter parking stipend | Dental insurance | Employee stock purchase program ESPP | Flexible time offSenior-level Full TimeSan Francisco, CA15h ago
-
AI Engineer USD 180K-240KAPIs | Agent systems | Cloud infrastructure | Evaluation | Graph traversalSenior-level Full TimeMountain View, CA16h ago
-
Sr AI Engineer - Agentic Systems USD 166K-205KAI Safety | API Integration | Agent Orchestration | Artificial Intelligence | Distributed SystemsSenior-level Full TimeAnywhere, US R16h ago
-
Research Engineer – Machine Learning & Robotics USD 159K-211KC++ | Computer Vision | Data Pipelines | Data Quality | Data quality monitoringSenior-level Full TimeLenexa, Kansas16h ago
-
Applied AI Specialist, Commercial Customer Success USD 105K-142KAPI Integration | Accuracy Monitoring | Automated testing | CRM | Evaluation FrameworksRemote workSenior-level Full TimeRemote - US R16h ago