AI Lab - LLM Applied Evaluation and Benchmark Intern
Beijing, Beijing, China
CNY 25K-37K (estimate) Entry-level Full Time Internship
Tasks
- Analyze hallucinations
- Analyze logical inconsistencies
- Assess multi step reasoning
- Build benchmarking datasets
- Conduct automated evaluation
- Conduct human evaluation
- Define evaluation metrics
- Design LLM evaluation framework
- Detect bias issues
- Evaluate model outputs
- Execute LLM application testing
- Perform data anonymization
- Perform data cleaning
- Perform data masking
- Present evaluation findings
- Validate AI agent applications
- Write evaluation reports
Perks/Benefits
- N/A
Skills/Tech-stack
Agent Frameworks | Anonymization | Boundary testing | Case design | Data Privacy | Data masking | LLM-as-a-Judge | Langchain | Language Models | Large Language Models | Llamaindex | OpenAI API | Prompt engineering | Python | RAG | SQL | Software testing | Test Case | Test Case Design
Education
N/A
Related jobs
-
大模型算法工程师--c端方向 CNY 240K-480KChain-of-Thought | Deep search | Information Retrieval | LLM Inference | LLM TrainingMid-level Full Time北京1d ago
-
AI Intern – RAG Engineering CNY 37K-44KContext Construction | Dify | Document parsing | Information Retrieval | LangchainEntry-level Full Time Internship北京市, 北京市, 中国1d ago
-
AI Intern – Agent & LLM Solutions CNY 37K-48KDify | Langchain | Langgraph | Language Models | Large Language ModelsEntry-level Full Time Internship北京市, 北京市, 中国1d ago
-
ANSYS | APDL | AVL Excite | C# | Durability analysisSenior-level Full TimeWuhan, Hubei, China1d ago
-
Machine Learning Engineer, Community Support Engineering CNY 240K-360KAgentic AI | Artificial Intelligence | Chatbots | Evaluation | Feedback learningOccasional office work | Remote eligibleMid-level Full TimeChina1d ago
-
Artificial Intelligence & Machine Learning, Off CNY 240K-360KAttribution Analysis | Classification | Data Governance | Data Modeling | EconometricsEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysMid-level Full TimeHangzhou, China1d ago
-
A/B | A/B Testing | AWS | B testing | Cohort AnalysisComprehensive benefits package | Flexible work model | Work from home flexibilityMid-level Full TimeShanghai, China R1d ago
-
ASIC Design Flow | ASIC design | C plus plus | Design flow | Low powerMid-level Full TimeChina, Shanghai1d ago
-
大模型算法实习生 CNY 36K-37KDeep learning | DeepSpeed | Distributed Training | GPU Training | JavaLarge scale text data access | Stable internship opportunity | Supportive team environment | Technical mentorshipEntry-level Internship北京、上海2d ago
-
大模型算法-校招 CNY 500K-500KDeep learning | DeepSpeed | Distributed Training | GPU Training | Information ExtractionLarge-scale datasets | NLP application projects | Relaxed team atmosphere | Technical mentorshipEntry-level Full Time上海、北京2d ago
-
Entry-level Full Time北京2d ago
-
高级Ai系统开发工程师(大模型与Rag方向) CNY 240K-480KAgent workflow | Caching | Distributed Systems | Dynamic batching | ElasticsearchSenior-level Full Time武汉2d ago
-
Senior-level Full Time北京2d ago
-
Senior-level Full TimeChina3d ago
-
Research Intern (AI Agent) CNY 25K-37KAgent systems | Embodied AI | Language Models | Large Language Models | Memory-augmented systemsEntry-level Full Time Internship深圳4d ago
-
机器人多模态数据工程实习生 CNY 25K-37KData Engineering | Data Processing | Data Storage | Data Visualization | Data alignmentCross-disciplinary collaboration | Hands on robotics projects | Research opportunityEntry-level Internship深圳4d ago
-
具身智能算法实习生 (Manipulation) CNY 25K-37KCLIP | Computer Vision | Deep learning | Diffusion Model | Fine TuningEntry-level Internship深圳4d ago
-
Agi 后端工程师-下一代Ai数据链路 CNY 180K-360KCache | Data Processing | Data Storage | Data cleaning | Data pipelineEntry-level Full Time北京、上海4d ago
-
校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 500K-500KAdapters | Direct Preference Optimization | Fine Tuning | Flax | Function designNone Full Time上海4d ago
-
Activation Function | Architecture Design | Automated testing | CI/CD | Computer VisionBirthday off | Flexible working hours | Local holidays | Onsite work | Paid vacationMid-level Full TimeShenzhen4d ago
-
Entry-level Internship Part TimeChina4d ago
-
Data Scientist CNY 216K-296KData Aggregation | Econometrics | Python | R | SQLEmployee Assistance Program (EAP) | Flexible working environment | LinkedIn Learning | Volunteer time offMid-level Full TimeShanghai, SH, China4d ago
-
AI & Data Science Intern CNY 38K-50KAgentic AI | Data Pipelines | Data Visualization | Embeddings | Generative AICross-functional collaboration | Hands-on projects | Innovation at scale | MentorshipEntry-level Full Time InternshipSHANGHAI, China4d ago
-
Senior GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | Edge ComputingOn-site workSenior-level Full TimeCHN - Minhang, China4d ago
-
Senior GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | Edge AIOn-site work modelSenior-level Full TimeCHN - Minhang, China4d ago