算法工程师-大模型数据方向
Tasks
- Apply data desensitization
- Build data quality evaluation metrics
- Build end to end data pipeline
- Build gold corpus dataset
- Clean raw corpus data
- Construct label system
- Create data feedback loop
- Design knowledge taxonomy
- Develop AI assisted labeling tools
- Develop data governance framework
- Optimize annotation workflow
- Perform data scoring
- Remove duplicates
- Run clustering analysis
- Write data cleaning scripts
Perks/Benefits
- N/A
Skills/Tech-stack
Annotation Automation | Apache Spark | Clustering | Data Annotation | Data Governance | Data Pipelines | Data Quality | Data Quality Evaluation | Data cleaning | Data desensitization | Data scoring | Heuristic rules | Knowledge graphs | Language Processing | Multiprocessing | Natural Language | Natural Language Processing | Pandas | PyTorch | Python | Quality evaluation | Regular Expressions | TensorFlow | Transformer
Education
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer
Related jobs
-
AVP - Data and Analytics CNY 300K-420KAI Governance | Algorithmic Compliance | Audit evidence | Behavioral analytics | Cause analysisFlexible working | Professional developmentExecutive-level Full TimeGuangzhou, Guangdong, China1d ago
-
ANSYS APDL | C Programming | Design of Experiments | Durability analysis | Element analysisMid-level Full TimeWuhan, Hubei, China1d ago
-
Ai算法工程师 CNY 144K-240KBig Data | Big data processing | Data Processing | Deep learning | Feature EngineeringEntry-level Full Time深圳1d ago
-
【校招实习】Ai算法工程师 CNY 25K-37KComputer Vision | Data Analysis | Deep learning | Feature Engineering | HadoopInternship opportunityEntry-level Internship深圳1d ago
-
Entry-level Internship深圳1d ago
-
Mid-level Full Time北京 R1d ago
-
大模型算法研究员-MiMo CNY 500K-500KAI Feedback | Active Learning | C plus plus | Curriculum learning | Deep learningMid-level Full Time北京1d ago
-
Mid-level Full Time武汉1d ago
-
Entry-level Full Time北京 R1d ago
-
Ai数据闭环研发工程师 CNY 240K-360KData Distribution | Data Distribution Strategy | Data Flywheel | Data Mining | Data evaluationSenior-level Full Time上海、北京1d ago
-
Mid-level Full Time上海1d ago
-
Senior-level Full Time上海、武汉、北京1d ago
-
Mid-level Full Time上海1d ago
-
Mid-level Full Time上海1d ago
-
数据智能团队负责人 CNY 240K-360KAnomaly Detection | ClickHouse | Data Governance | Data Modeling | Data QualitySenior-level Full Time上海1d ago
-
Senior-level Full Time上海1d ago
-
Mid-level Internship上海、北京1d ago
-
Mid-level Full Time上海1d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Evaluation Frameworks | Function CallingEntry-level Full Time上海、北京1d ago
-
Senior-level Full Time上海1d ago
-
[NCA and TW Ads] Senior Staff Machine Learning Engineer CNY 180K-300KContextual bandit | DIN | Deep Interest Network | Deep learning | Distributed SystemsSenior-level Full TimeShanghai, China1d ago
-
Manager, AI / Data Scientist CNY 300K-380KClustering | Convolutional Neural Network | Data Analysis | Data Preparation | Data QualityMid-level Full TimeAIA AI Tech ED (Shanghai) Hongkou, …2d ago
-
Action models | C++ | Data Generation | Dataset curation | Deep learningSenior-level Full TimeChina, Shanghai2d ago
-
Infrastructure Maintenance CNY 60K-60KArtificial Intelligence | ChatGPT | Data Analysis | Language Processing | Machine LearningEntry-level Full TimeChaoyang, BJ, CN2d ago
-
Data Analytics Engineer CNY 300K-420KAccuracy | Artificial Intelligence | Citation | Docker | ElasticsearchNo business travel required | No overtime requirement | Standard working hoursEntry-level Full TimeShanghai, SH, CN, 2000312d ago