Ai数据工程实习生(训练数据 & 清洗方向)
Tasks
- Build denoising data processing
- Coordinate data collection, cleaning, labeling
- Create data quality evaluation system
- Define annotation guidelines
- Design training data pipeline
- Develop structured sample extraction
- Handle edge and failure scenarios
- Implement data deidentification
- Iterate data strategy with model feedback
- Optimize large scale data processing pipeline
- Support data flywheel system
- Validate and standardize data format
Perks/Benefits
- N/A
Skills/Tech-stack
Code Data Processing | Data Deidentification | Data Processing | Data Quality | Data Quality Evaluation | Data Standardization | Data Validation | Data cleaning | Data labeling | Data pipeline | Instruction Tuning | LLM | Large Scale Data | Large-scale | Large-scale Data Processing | Python | Quality evaluation | Trajectory Data
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
Entry-level Internship深圳22h ago
-
AI Agent 开发实习生(通用智能仿真方向) CNY 25K-37KAI Agent | API Integration | Asynchronous programming | Autogen | C++Flexible learning | Internship | MentorshipEntry-level Internship广州22h ago
-
3D Gaussian Splatting | 3D Geometry | 3D Object Detection | Algorithms | Autoregressive GenerationMid-level Full Time北京、上海、苏州23h ago
-
Forward Deployed Architect, Generative AI, Google Cloud CNY 360K-600KCI/CD | Cloud platform | Data Pipelines | Data Sovereignty | ExperimentationSenior-level Full TimeShenzhen, Guangdong Province, China; Shanghai, China1d ago
-
Entry-level Internship广州1d ago
-
Robotaxi VLA 大模型算法实习生 CNY 25K-37KAutonomous Driving | C++ | Case Development | Computer Vision | Data cleaningEntry-level Internship广州1d ago
-
Entry-level Full Time北京、上海 R1d ago
-
Robotaxi VLA 大模型算法实习生 CNY 25K-37KAutonomous Driving | C++ | Data labeling | Fine Tuning | Functional SafetyEntry-level Internship广州1d ago
-
Senior-level Full Time上海1d ago
-
Associate Director, Data and Analytics CNY 240K-360KAutomation | Batch Processing | BigQuery | CI/CD | Cloud StorageMid-level Full TimeGuangzhou, Guangdong, China2d ago
-
Deep Learning Performance Architect, CUTLASS DSL Testing CNY 360K-600KAutomated testing | Code Coverage | GPU Computing | MLIR | PythonSenior-level Full TimeChina, Shanghai2d ago
-
Mid-level Full TimeChina, Shanghai2d ago
-
C# | C++ | Debugging | Deep learning | Generative AISenior-level Full TimeChina, Shanghai2d ago
-
Mid-level Full TimeShenzhen, Guangdong, China2d ago
-
校招-机器人感知算法开发工程师(目标检测方向) CNY 240K-360K3D Reconstruction | C++ | Camera Calibration | Cloud processing | Coordinate TransformationNone Full Time上海、合肥、北京2d ago
-
Mid-level Full Time广州2d ago
-
AI Feedback | Deep learning | Human Feedback | Language Models | Language ProcessingMid-level Full Time上海2d ago
-
Senior-level Full Time上海、武汉、北京2d ago
-
算法工程师-大模型数据方向 CNY 240K-480KAutomated Evaluation | Clustering | Corpus Synthesis | Data Augmentation | Data DeduplicationFull time remote N/ASenior-level Full Time上海2d ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KAlgorithm | Data Governance | ETL | Elasticsearch | Information ArchitectureMid-level Full Time上海2d ago
-
Mid-level Full Time上海2d ago
-
Ai 应用研发工程师(上海) CNY 240K-480KAgent | Alerting | Concurrency | Cost Optimization | Deployment pipelineSenior-level Full Time上海2d ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | B testing | DPO | DeepSpeedInternship opportunityMid-level Internship上海2d ago
-
Mid-level Full Time上海2d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | Distributed Training | Function Calling | GRPO | JavaEntry-level Full Time上海、北京2d ago