算法工程师-大模型数据方向
Tasks
- Apply heuristic scoring and clustering to build golden dataset
- Build annotation tools to improve labeling throughput
- Build data quality evaluation metrics
- Build end to end data system for large language model
- Create corpus quality standards by source
- Design knowledge graph mapping and tagging framework
- Develop AI assisted data labeling workflow
- Develop data cleaning scripts for large scale corpora
- Guide dataset optimization using model feedback
- Implement data governance framework
- Mine and integrate general and domain corpora
- Process noise deduplicate and redact data
Perks/Benefits
- N/A
Skills/Tech-stack
Apache Spark | Clustering | Data Augmentation | Data Deduplication | Data Governance | Data cleaning | Data labeling | Data redaction | Heuristic Scoring | Knowledge graph | Language Processing | Multiprocessing | Natural Language | Natural Language Processing | Pandas | PyTorch | Python | Regular Expressions | TensorFlow | Transformer
Education
Related jobs
-
Senior-level Full Time上海、武汉、北京9h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent processing | Data Governance | ETL | Elasticsearch | Information ArchitectureFull-time employmentMid-level Full Time上海9h ago
-
Mid-level Full Time上海9h ago
-
Senior-level Full Time上海9h ago
-
Senior-level Full Time上海9h ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | Agentic reinforcement learning | B testing | DeepSpeedMid-level Internship上海、北京9h ago
-
Mid-level Full Time上海9h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KDistributed Training | Function Calling | GRPO | Human Feedback | JSONEntry-level Full Time上海、北京9h ago
-
Senior-level Full Time上海9h ago
-
Embedded Software Eng. CNY 180K-300KARM | ASPICE | Automotive Software | Automotive Software Development | C#Mid-level Full TimeWuhu, CN18h ago
-
AI/LLM Application Engineer CNY 280K-330KAPI | Access Control | Audit Logging | Authentication | AuthorizationMid-level Full TimeShenyang - PIC, China1d ago
-
AI/LLM Application Engineer CNY 280K-330KAccess Control | Audit Logging | Backend Development | Citation Generation | Document chunkingMid-level Full TimeShenyang - PIC, China1d ago
-
Senior-level Full TimeLOC3254: No.3239 Shenjiang Road, Shanghai, Pudong …1d ago
-
Sr. System Software Engineer CNY 240K-480KAAC | ARM | Audio/Video | Audio/Video Encoding | BashOn-site support | Remote support | Technical consulting | TrainingSenior-level Full TimeChina Shanghai1d ago
-
Senior Software Engineer - Robot Compute Platform CNY 240K-480KC# | C++ | CAN bus | CUDA | Deterministic systemsSenior-level Full TimeShanghai, China1d ago
-
Motion Control Engineer - Actuator Control Algorithms CNY 360K-600KAnti Windup | BLDC | Cogging Compensation | Commutation | Control loopSenior-level Full TimeShanghai, China1d ago
-
CI/CD | Docker | ETL | FastAPI | FlaskEntry-level InternshipShanghai, YANGPU, China1d ago
-
Senior Gen AI Software Solutions Engineer CNY 240K-360KAutogen | C++ | Deep learning | Edge AI | EmbeddingsOn-site work modelSenior-level Full TimeCHN - Minhang, China2d ago
-
优才-多模态交互算法工程师-X-Lab CNY 240K-480KAttention | Benchmarking | Computer Vision | Deep learning | Hard Negative MiningSenior-level Full Time上海、深圳2d ago
-
Mid-level Full Time深圳 R2d ago
-
Mid-level Full Time北京 R2d ago
-
Gaming AI Engineer CNY 304K-380KAlgorithms | Automatic Speech Recognition | C# | C++ | Computer ArchitectureMid-level Full TimeShenzhen, Guangdong, China2d ago
-
Forward Deployed AI Engineer CNY 72K-96KAWS | Agile | Amazon Redshift | BigQuery | Cloud platformTravel up to 50 percentEntry-level Full Time Internship北京2d ago
-
Mid-level Full Time北京 R2d ago
-
Mid-level Full Time Temporary北京2d ago