算法工程师-大模型数据方向
Tasks
- Build end to end data governance framework
- Build gold dataset using heuristics model scoring and clustering
- Construct data quality evaluation metrics
- Create data production evaluation iteration feedback loop
- Design knowledge taxonomy for large model data assets
- Develop AI assisted labeling pipeline
- Develop cleaning scripts with rules and quality standards
- Handle noise deduplication and data anonymization
- Implement data cleaning for large corpora
- Map knowledge graph to labels
- Optimize labeling workflow and throughput
Perks/Benefits
Skills/Tech-stack
Automated Evaluation | Clustering | Corpus Synthesis | Data Augmentation | Data Deduplication | Data Governance | Data anonymization | Data cleaning | Data labeling | Dataset evaluation | Heuristic rules | Knowledge graph | Language Processing | Multiprocessing | Natural Language | Natural Language Processing | Pandas | PySpark | PyTorch | Python | Regular Expressions | Spark | TensorFlow | Transformers
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
校招-机器人感知算法开发工程师(目标检测方向) CNY 240K-360K3D Reconstruction | C++ | Camera Calibration | Cloud processing | Coordinate TransformationNone Full Time上海、合肥、北京6h ago
-
实习-感知算法开发工程师 CNY 25K-37K3D Reconstruction | 6D Pose Estimation | C++ | Camera Calibration | Cloud processingInternshipEntry-level Internship合肥6h ago
-
Entry-level Internship深圳6h ago
-
Mid-level Full Time广州7h ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KAction models | CLIP | Deep learning | LLaVA | Language ModelsHands on real world robot testing | Internship opportunity | MentorshipEntry-level Internship深圳7h ago
-
AI Feedback | Deep learning | Human Feedback | Language Models | Language ProcessingMid-level Full Time上海9h ago
-
Senior-level Full Time上海、武汉、北京9h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KAlgorithm | Data Governance | ETL | Elasticsearch | Information ArchitectureMid-level Full Time上海9h ago
-
Mid-level Full Time上海9h ago
-
Ai 应用研发工程师(上海) CNY 240K-480KAgent | Alerting | Concurrency | Cost Optimization | Deployment pipelineSenior-level Full Time上海9h ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | B testing | DPO | DeepSpeedInternship opportunityMid-level Internship上海10h ago
-
Mid-level Full Time上海10h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | Distributed Training | Function Calling | GRPO | JavaEntry-level Full Time上海、北京10h ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KCLIP | Deep learning | LLaVA | Language Model | Large Language ModelEntry-level Internship深圳10h ago
-
Entry-level Internship上海10h ago
-
Entry-level Internship深圳10h ago
-
Asset Management - AI Algorithm Engineer - Associate/VP CNY 300K-420KDeep learning | Fine Tuning | Java | Langchain | Language ModelsExecutive-level Full TimeShanghai, China21h ago
-
Senior-level Full TimeChengdu, China22h ago
-
Senior Applied AI Engineer CNY 360K-600KAPI Design | Agentic Workflows | Automation | CI/CD | Coding AgentsSenior-level Full TimeChina, Shanghai1d ago
-
Agent systems | Bioinformatics | Cloud deployment | Containerization | Data EngineeringFlexible work model | In person collaboration culture | Productivity support | Wellbeing supportSenior-level Full TimeWSI01 - DXC Wuhan Optical Valley …1d ago
-
Senior-level Full TimeChina Shanghai1d ago
-
Senior-level Full TimeShenzhen, Guangdong, China1d ago
-
R&D – IoT Robotics Engineer CNY 360K-600KC++ | CI/CD | Camera pipeline | Control Systems | Data GenerationSenior-level Full TimeShenzhen, Guangdong, China1d ago
-
Entry-level Full Time广州1d ago
-
Llm实习生 CNY 36K-48KC++ | Deep learning | Language Models | Language Processing | Large Language ModelsEntry-level Internship上海1d ago