Senior LLM Evaluation Researcher - TikTok
San Jose, California, United States
USD 218K-389K (estimate) Senior-level Full Time
Tasks
- Analyze benchmark results
- Build automated LLM evaluation
- Conduct qualitative user research
- Conduct quantitative user research
- Define LLM evaluation framework
- Design evaluation programs
- Identify experience gaps
- Maintain evaluation datasets
- Recommend prioritized improvements
- Run benchmark assessments
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | Data Curation | Dataset Management | Language Models | Language Processing | Large Language Models | Machine Learning | Model Evaluation | Natural Language | Natural Language Processing | Qualitative research | Quantitative Research | User Research
Education
N/A
Related jobs
-
Lead Quantitative UX Researcher USD 150K-185KA/B | A/B Testing | B testing | Bayesian statistics | Behavioral Modeling401k match | Annual bonus | Annual performance reviews | Career development support | Company equipmentSenior-level Full TimeAtlanta, GA preferred, Remote R1d ago
-
Senior Quantitative Researcher - Interest Rates USD 150K-250KBacktesting | Feature Engineering | Machine Learning | Python | Quantitative modelingHealth benefits | Profit sharingSenior-level Full TimeChicago, Illinois, United States1d ago
-
Post Doctoral Fellow (vc-llm) USD 80K-159KEvaluation | Experiment design | Game theory | Language Models | Large Language ModelsNone Full TimePittsburgh, PA1d ago
-
Robotics Engineer - ATAS - Open Rank USD 134K-179KC++ | Gazebo | Linux | Machine Learning | Multi-robot systemsHealth & welfare benefits | Professional development | Retirement plans | Time off | Tuition reimbursementEntry-level Full TimeSmyrna, GA1d ago
-
Postdoctoral Researcher, AI and Systems Co-design Team USD 112K-145KAlgorithms | Compilers | Computer Architecture | Distributed Systems | Machine LearningEntry-level Full TimeMenlo Park, CA1d ago
-
Quantitative Trader USD 150K-216KAlgorithmic trading | Deep learning | Linear Models | Machine Learning | Quantitative AnalysisCollaborative work environment | Firmwide educational curriculum | Hands-on trainingMid-level Full TimeNew York, New York, United States1d ago
-
Quantitative Researcher USD 150K-200KDistributed Training | Feature Engineering | Hyperparameter Tuning | Machine Learning | PythonSenior-level Full TimeNew York, New York, United States1d ago
-
Machine Learning Researcher - PhD: 2026 USD 175K-225KAlgorithm Design | C++ | Deep learning | Feature Engineering | Hyperparameter TuningAccess to high-performance computing resources | Best in class financial data access | Direct impact on real time trading performanceMid-level Full TimeBala Cynwyd (Philadelphia Area), PA, United …1d ago
-
Sr. Applied Scientist, Amazon Robotics USD 167K-226KAI Reasoning | Algorithm Development | Classical AI | Classical AI Reasoning | Language ModelsSenior-level Full TimeBoston, Massachusetts, USA1d ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Belief State Tracking | C++ | Decision Making | Distributed Reinforcement LearningSenior-level Full TimeUnited States R2d ago
-
Computational Chemistry | Field development | Force Field | Force field development | Machine LearningDental insurance | Employee assistance program | Flexible spending accounts | Health insurance | Life insuranceMid-level Full TimeGILMAN - Gilman Hall, United States2d ago
-
AIML Research Associate USD 46K-66KAblation Study | Anomaly Detection | CI/CD | Classification | Code review401k matching | Dental insurance | Health insurance | Long-term disability | Paid HolidaysEntry-level Part TimeNASA LaRC - Hampton, VA, United …2d ago
-
Research Scientist, Frontier Health, DeepMind USD 174K-252KClinical Reasoning | Evaluation | Experimentation | GRPO | Human evaluationMid-level Full TimeMountain View, CA, USA4d ago
-
Research Scientist in Generative AI Graduate (Intelligent Creation) - 2026 Start (PhD) USD 136K-250K3D Generation | Artificial Intelligence | Computer Vision | Deep learning | Generative AIEntry-level Full TimeSan Jose, California, United States5d ago
-
Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD) USD 202K-368KCoding | Data Construction | Fine Tuning | Instruction Tuning | Language ModelsInternshipEntry-level Full TimeSan Jose, California, United States6d ago
-
Bash | ESPnet | Linux | Machine Learning | PyTorchSenior-level Full TimeUS-MD-COLUMBIA-720 ~ 9861 Broken Land Pkwy …6d ago
-
Bioinformatics | Deep learning | Dynamic Gene Regulatory Networks | Gene Regulatory Networks | MATLABNone Full TimeLocation S, United States6d ago
-
Applied Researcher, Perception USD 139K-201KComputer Vision | High Performance | High-Performance Computing | Language Models | Language ProcessingHybrid work environmentNone Full TimeMountain View, California; Pittsburgh, Pennsylvania; San …6d ago
-
Amazon Redshift | C++ | Econometrics | Machine Learning | NumPyBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersEntry-level Full TimeNew York, NY, United States7d ago
-
Quantitative Researcher, Quantitative Strategies USD 150K-200KBacktesting | Data Mining | LLMs | Language Processing | Machine LearningComprehensive benefitsSenior-level Full TimeNew York, New York, United States …7d ago
-
Staff AI Researcher USD 148K-210KData Preprocessing | Deep learning | Distributed Systems | Feature Engineering | Fine Tuning401k match | Dental insurance | Educational reimbursement | Flexible work schedule | Health insuranceSenior-level Full TimeRemote, United States R7d ago
-
Artificial Intelligence | Automation | Data Modeling | Data Processing | Data analyticsBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersExecutive-level Full TimeNew York, NY, United States8d ago
-
Sr. Responsible AI Researcher, AI.x USD 160K-259KAI ethics | Adversarial Robustness | Alignment | Artificial Intelligence | Bias detectionSenior-level Full TimeSan Francisco, CA, United States8d ago
-
AI Researcher, AI.x USD 150K-275KAI reliability | Agentic Systems | Deep learning | Experimentation | GenAIHybrid work | On-site collaborationSenior-level Full TimeSan Francisco, CA, United States8d ago
-
Algorithmic trading | Automated Execution | Data Analysis | Econometrics | Execution Strategy OptimizationAnnual discretionary bonus | Flexible time off | Healthcare benefits | Hybrid work model | Retirement benefitsSenior-level Full TimeNY7 - 50 Hudson Yards, New … R8d ago