Find jobs in AI/ML, Data Science and Big Data
2 results
for Inter-rater reliability
(Skill/Tech stack)
-
Research Scientist, LLM Evaluation & Post-Training USD 150K-300KAI Feedback | Alignment | Benchmarking | Context evaluation | Deep learningMid-level Full TimeRemote Work( USA), United States R12d ago
-
Bayesian Modeling | Classical Test Theory | Cohen Kappa | Computational Linguistics | Data PipelinesSenior-level Full TimeMountain View, CA, USA24d ago