大模型Post-Training 算法工程师
Tasks
- Analyze instruction following and QA bottlenecks
- Apply RLHF and RLAIF methods
- Apply SFT methods
- Build automated training data construction and synthesis
- Design and implement LLM post training pipelines
- Develop online data flywheel for continuous iteration
- Optimize model algorithms for quality and stability
Perks/Benefits
- N/A
Skills/Tech-stack
AI Feedback | Deep learning | Human Feedback | Language Models | Language Processing | Large Language Models | Learning from Human Feedback | Machine Learning | Mixture of Experts | Natural Language | Natural Language Processing | PyTorch | Python | RLAIF | RLHF | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | SFT | Transformer
Education
Related jobs
-
校招-机器人感知算法开发工程师(目标检测方向) CNY 240K-360K3D Reconstruction | C++ | Camera Calibration | Cloud processing | Coordinate TransformationNone Full Time上海、合肥、北京7h ago
-
Entry-level Internship深圳7h ago
-
大数据开发(数据挖掘、数据测试、java) CNY 25K-37KApache Hadoop | Apache Kafka | Apache Spark | Apache Sqoop | Data MiningEntry-level Full Time保定8h ago
-
Mid-level Full Time广州8h ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KAction models | CLIP | Deep learning | LLaVA | Language ModelsHands on real world robot testing | Internship opportunity | MentorshipEntry-level Internship深圳8h ago
-
Senior-level Full Time上海、武汉、北京10h ago
-
算法工程师-大模型数据方向 CNY 240K-480KAutomated Evaluation | Clustering | Corpus Synthesis | Data Augmentation | Data DeduplicationFull time remote N/ASenior-level Full Time上海10h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KAlgorithm | Data Governance | ETL | Elasticsearch | Information ArchitectureMid-level Full Time上海10h ago
-
Mid-level Full Time上海10h ago
-
Ai 应用研发工程师(上海) CNY 240K-480KAgent | Alerting | Concurrency | Cost Optimization | Deployment pipelineSenior-level Full Time上海10h ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | B testing | DPO | DeepSpeedInternship opportunityMid-level Internship上海10h ago
-
Mid-level Full Time上海10h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | Distributed Training | Function Calling | GRPO | JavaEntry-level Full Time上海、北京11h ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KCLIP | Deep learning | LLaVA | Language Model | Large Language ModelEntry-level Internship深圳11h ago
-
Entry-level Internship上海11h ago
-
Entry-level Internship深圳11h ago
-
Asset Management - AI Quant Analyst - Artificial Intelligence & Machine Learning Focus CNY 304K-389KAsset pricing | Backtesting | Deep learning | Econometrics | Language ModelsMid-level Full TimeShanghai, China18h ago
-
Senior AI Architect CNY 360K-600KAI Search | Access Control | Agentic Workflows | Alibaba Cloud | Amazon Web ServicesSenior-level Full TimeShanghai, SH, CN22h ago
-
Asset Management - AI Algorithm Engineer - Associate/VP CNY 300K-420KDeep learning | Fine Tuning | Java | Langchain | Language ModelsExecutive-level Full TimeShanghai, China22h ago
-
Senior-level Full TimeChengdu, China23h ago
-
Senior Applied AI Engineer CNY 360K-600KAPI Design | Agentic Workflows | Automation | CI/CD | Coding AgentsSenior-level Full TimeChina, Shanghai1d ago
-
Agent systems | Bioinformatics | Cloud deployment | Containerization | Data EngineeringFlexible work model | In person collaboration culture | Productivity support | Wellbeing supportSenior-level Full TimeWSI01 - DXC Wuhan Optical Valley …1d ago
-
Senior-level Full TimeChina Shanghai1d ago
-
Senior-level Full TimeShenzhen, Guangdong, China1d ago
-
R&D – IoT Robotics Engineer CNY 360K-600KC++ | CI/CD | Camera pipeline | Control Systems | Data GenerationSenior-level Full TimeShenzhen, Guangdong, China1d ago