算法工程师-大模型数据方向
Tasks
- Build annotation support tools to improve throughput
- Build end to end model data pipeline
- Build golden corpus using clustering and scoring
- Close loop using model feedback to iterate datasets
- Construct data quality evaluation metrics
- Create multidimensional labeling system
- Define corpus classification standards
- Design model knowledge framework
- Develop AI assisted annotation workflow
- Develop data governance framework
- Implement corpus cleaning scripts
- Map knowledge graph features
- Perform noise removal deduplication and desensitization
- Set source specific quality standards
Perks/Benefits
- N/A
Skills/Tech-stack
Apache Spark | Clustering | Data Annotation | Data Annotation Automation | Data Governance | Data Quality | Data Quality Evaluation | Data cleaning | Data labeling | Fine Tuning | Knowledge graphs | Language Processing | Natural Language | Natural Language Processing | Pandas | Pretraining | PyTorch | Python | Quality evaluation | Regular Expressions | TensorFlow | Transformer
Education
Bachelor of Arts | Bachelor of Engineering | Bachelor of Science
Related jobs
-
Senior-level Full Time上海、武汉、北京4h ago
-
数据库开发工程师 CNY 240K-420KC++ | Caching | Database Internals | Distributed Systems | Distributed consistencyEntry-level Full Time北京 R20h ago
-
模型部署与推理优化工程师 CNY 180K-360KC++ | Edge inference | Inference Performance | Inference Performance Optimization | Model DistillationMid-level Full Time北京 R20h ago
-
Mid-level Full Time北京20h ago
-
2026届秋招-大数据开发工程师 CNY 180K-300KApache Spark | Java | Linux | Python | SQLMentorship | Regular technical sharing | Technical trainingNone Full Time上海1d ago
-
Entry-level Internship上海 R1d ago
-
Access Control | Alerting | BigQuery | DBT | Data GovernanceAutonomy | Flat organization | Growth opportunities | Ownership | TrustMid-level Full TimeChina1d ago
-
Auto Verification | C++ | Computer Vision | Device Drivers | Digital TwinCompetitive salary | Comprehensive benefits packageSenior-level Full TimeChina, Shanghai1d ago
-
Senior-level Full Time InternshipGuangzhou2d ago
-
数据管线高级工程师 CNY 240K-480KApache Iceberg | Data Lineage | Data Processing | Data Versioning | Distributed SystemsSenior-level Full Time广州2d ago
-
AWS | Azure | Cloud Computing | Data Preprocessing | Entity recognitionAccident insurance | Annual leave | Dental coverage | Employee discount | Life insuranceSenior-level Full TimeHong Kong, Hong Kong, China2d ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous work culture | Friendly work environment | Opportunities for impact | Portfolio and LinkedIn submissionMid-level Full TimeBeijing, China2d ago
-
AI GPU Arch Perf Optimization Intern CNY 38K-50KAttention | CUDA | GEMM | OpenCL | Operator fusionOn-site workEntry-level Full Time InternshipCHN - Minhang, China2d ago
-
AI GPU Arch Perf Optimization Intern CNY 38K-50KCUDA | Computer Systems | GPU Kernels | GPU Programming | Memory systemsOn-site workEntry-level Full Time InternshipCHN - Minhang, China2d ago
-
AI GPU Arch Perf Optimization Intern CNY 38K-50KAI Fundamentals | Attention | CUDA | Computer Systems | GEMMCollaborative team environment | Internship experience | On-site workEntry-level Full Time InternshipCHN - Minhang, China2d ago
-
ATE | ATPG | Applied Machine Learning | C++ | DFTCross-functional collaboration | MentorshipSenior-level Full TimeChina, Shanghai2d ago
-
Software Engineering & Development, SrAssc CNY 38K-50KAWS | Azure | Data Engineering | Data analytics | Deep learningEmployee networks | Flexible work/life support | Inclusive development opportunities | Mentorship | Paid volunteer daysEntry-level Full TimeHangzhou, China2d ago
-
Entry-level Full Time北京 R3d ago
-
多模态大模型算法工程师(偏Llm) CNY 500K-500KComputer Vision | Deep learning | Fine Tuning | Language Models | Large Language ModelsMid-level Full Time北京3d ago
-
Mid-level Full Time武汉3d ago
-
Entry-level Full Time北京 R3d ago
-
Senior-level Full Time北京3d ago
-
Senior-level Full Time北京3d ago
-
Entry-level Full Time北京 R3d ago
-
机器人VLA算法研究员 - XiaomiRobotics CNY 500K-500KDeep learning | Diffusion Models | Language Models | Machine Learning | Mixture of ExpertsEntry-level Full Time北京3d ago