AI Evaluation Scientist
Tasks
- Analyze model behavior
- Assess AI model outputs
- Build evaluation scripts
- Collaborate with data scientists
- Contribute to evaluation framework development
- Design evaluation processes
- Develop benchmark datasets
- Develop test harnesses
- Document evaluation results
- Perform error analysis
- Support responsible deployment
Perks/Benefits
Skills/Tech-stack
AI Evaluation | AI evaluation frameworks | Behavior Analysis | Data Analysis | Evaluation Frameworks | Evaluation metrics | Hugging Face | Langchain | Language Processing | Model behavior | Model behavior analysis | Natural Language | Natural Language Processing | PyTorch | Python | Scikit-learn | Statistical Testing | Test Design
Education
Roles
Related jobs
-
AI Research Scientist - FAIR Social Intelligence USD 177K-251KArtificial Intelligence | Behavioral Science | Game theory | Machine Learning | PythonMid-level Full TimeBellevue, WA1h ago
-
AI Research Scientist - FAIR Social Intelligence USD 177K-251KArtificial Intelligence | Game theory | Machine Learning | Python | Reinforcement LearningEntry-level Full TimeBellevue, WA | Seattle, WA1h ago
-
AI Research Scientist, FAIR Chemistry USD 117K-173KApplied Mathematics | Artificial Intelligence | Computational statistics | Generative Models | Machine LearningEntry-level Full TimeSan Francisco, CA1h ago
-
AI Research Scientist, FAIR Chemistry USD 147K-208KApplied Mathematics | Artificial Intelligence | Computational statistics | Generative Models | Machine LearningSenior-level Full TimeMenlo Park, CA | San Francisco, …1h ago
-
AI research | Computer Vision | Data Curation | Language Processing | Model TrainingSenior-level Full TimeMenlo Park, CA1h ago
-
AI Engineer USD 118K-177KData Analysis | Distributed Computing | Machine Learning | Optimization | PythonBenefits package | Flexible work arrangementsEntry-level Full TimeJuno Beach, FL, US, 334085h ago
-
Senior AI/ML Engineer USD 125K-157KAI platforms | AWS | Agentic AI | Agentic AI Platforms | Data GovernanceDental insurance | Disability plans | Employee referral program | Fertility benefits | Life insuranceSenior-level Full TimeUS Remote R6h ago
-
AI Program Manager USD 125K-165KAI Technologies | Agile methodology | Automation | Business Analysis | Change Management401k benefits | Career development opportunities | Flexible work | Performance incentives | Spot AwardsMid-level Full TimeBoston, MA, United States8h ago
-
AI Engineer USD 140K-180KAI Automation | APIs | CI/CD | Data integration | JavaScriptCollaborative environment | Flexible working hours | Growth opportunities | Innovative projects | Remote work optionsMid-level Full TimeNew York, New York8h ago
-
Applied Scientist, Demand Forecasting USD 142K-193KDeep learning | Distributed Systems | Generative AI | Graph Neural Networks | Neural NetworksMid-level Full TimeBellevue, Washington, USA12h ago
-
ML Architect USD 152K-202KAI Governance | APIs | Cloud Native | Distributed Systems | EmbeddingsCollaborate with Fortune 500 | Impact at AI-first company | Upskill opportunities | Work with innovative technologySenior-level Full TimeUSA - Remote, United States R12h ago
-
AI Safety | Cloud Platforms | Containerization | Deep learning | Distributed Systems401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States12h ago
-
AI/ML Intern USD 66K-100KData Analysis | Deep learning | Java | Machine Learning | PyTorchEducation reimbursement | Health plans | Parental leave | Retirement options | Time offEntry-level InternshipUSA - Update Location14h ago
-
AI Product Engineer USD 149K-214KAI APIs | API Integration | Agents | Data Pipelines | LLMsComprehensive benefits | Flexible work hoursMid-level Full TimeRemote - USA R16h ago
-
Social Scientist Statistician (PhD) USD 89K-166KBehavioral Science | Data Analysis | Data Visualization | Multivariate analysis | Research DesignFlexible work arrangements | Healthcare benefits | Professional development opportunitiesMid-level Full TimeTampa, FL16h ago
-
Senior Data Scientist USD 150K-200KA/B | A/B Testing | B testing | Causal Inference | Data AnalysisSenior-level Full TimeFort Lauderdale Office17h ago
-
Scientist II, Protein Analytics USD 121K-230KCapillary electrophoresis | Data Analysis | Experimental Design | Method validation | Protein HPLCDental insurance | Medical insurance | Paid time off | Retirement plan | Short-term incentivesSenior-level Full TimeNorth Chicago, IL, United States17h ago
-
Artificial Intelligence | C++ | Data Analysis | High Performance | High-Performance ComputingCollaborative culture | Medical/Dental/Vision | Paid time off | Retirement plan | Training programsSenior-level Full TimeColorado Springs, CO17h ago
-
Lead Machine Learning / Data Science Engineer USD 90K-200KAWS | Agent Orchestration | Agentic AI | Azure | Data Warehousing401k matching | Employee resource groups | Fertility and family benefits | Flexible work environment | Health and well-being programsSenior-level Full TimeChicago, IL, United States18h ago
-
Computational Chemist USD 52K-62KAPI | C# | C++ | Data Processing | Data analytics401k | Dental insurance | Disability insurance | Life insurance | Medical insuranceEntry-level Full TimeGroton, CT, United States18h ago
-
Pharmaceutical Computational Scientist USD 52K-60KAI | APIs | Automation | Bayesian optimization | C#401k | Dental insurance | Disability insurance | Medical insurance | Paid HolidaysEntry-level Full TimeGroton, CT, United States18h ago
-
Computational Scientist USD 50K-62KAPI | Automation engineering | C# | C++ | Data Science401k | Dental coverage | Disability insurance | Life insurance | Medical coverageEntry-level Full TimeGroton, CT, United States18h ago
-
AI Intern (Architecture & AI) USD 50K-50KAI Technologies | API | Data Analysis | Excel | Generative AIFlexible work arrangements | Healthcare benefits | Mental health resources | Paid time off | Parental leaveEntry-level InternshipEnglewood Cliffs, NJ18h ago
-
Lead Machine Learning / Data Science Engineer USD 90K-200KAWS | Agent Orchestration | Agentic AI | Amazon RDS | AzureCommunity partnerships | Employee resource groups | Family and fertility benefits | Flexible work environment | Learning and development programsSenior-level Full TimeDenver, CO, United States18h ago
-
Analytics Practitioner USD 86K-176KClient Communication | Data Analysis | Data Visualization | Data strategy | Palantir FoundryCertification opportunities | Inclusive work environmentSenior-level Full TimeWashington, DC19h ago