大模型算法工程师(开放域对话)
上海
CNY 180K-300K (estimate) Mid-level Internship
Tasks
- Accelerate inference with vLLM
- Apply GRPO PPO and DPO optimization
- Build end to end dialogue data pipelines
- Clean and deduplicate raw corpora
- Develop LLM algorithms for open domain dialogue systems
- Engineer prompts and tool use strategies
- Improve intent recognition personalization and task success metrics
- Improve multi turn dialogue planning accuracy with agentic reinforcement learning
- Optimize foundation models with SFT and RLHF
- Quantize and distill models for faster inference
- Reduce hallucinations in agentic tool use
- Run offline evaluations and online A B testing
- Support low latency deployment on cloud or edge
- Track dialogue state with DST methods
- Train reward model datasets for reinforcement learning
Perks/Benefits
Skills/Tech-stack
DPO | Deep learning | DeepSpeed | Distributed Training | Function Calling | GRPO | Hallucination reduction | Knowledge Distillation | Language Models | Large Language Models | Model Quantization | OpenRLHF | PPO | Prompt engineering | Python | RLAIF | RLHF | React | Reinforcement Learning | Reward Modeling | SFT | Thought Intermediate Result | Tool use | VLLM
Education
Language: zh
Views:
9
Clicks:
1
Saves: 0
Related jobs
-
Entry-level Internship深圳19h ago
-
Entry-level Full Time上海20h ago
-
Entry-level Full Time上海21h ago
-
AI Feedback | Deep learning | Direct Preference Optimization | Fine Tuning | Human FeedbackMid-level Full Time上海22h ago
-
Senior-level Full Time上海、武汉、北京22h ago
-
算法工程师-大模型数据方向 CNY 240K-360KApache Spark | Clustering | Data Augmentation | Data Deduplication | Data GovernanceSenior-level Full Time上海22h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent processing | Data Governance | ETL | Elasticsearch | Information ArchitectureFull-time employmentMid-level Full Time上海22h ago
-
Mid-level Full Time上海22h ago
-
Senior-level Full Time上海22h ago
-
Senior-level Full Time上海23h ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | Agentic reinforcement learning | B testing | DeepSpeedMid-level Internship上海、北京23h ago
-
Mid-level Full Time上海23h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KDistributed Training | Function Calling | GRPO | Human Feedback | JSONEntry-level Full Time上海、北京23h ago
-
Senior-level Full Time上海23h ago
-
Embedded Software Eng. CNY 180K-300KARM | ASPICE | Automotive Software | Automotive Software Development | C#Mid-level Full TimeWuhu, CN1d ago
-
AI/LLM Application Engineer CNY 280K-330KAPI | Access Control | Audit Logging | Authentication | AuthorizationMid-level Full TimeShenyang - PIC, China1d ago
-
AI/LLM Application Engineer CNY 280K-330KAccess Control | Audit Logging | Backend Development | Citation Generation | Document chunkingMid-level Full TimeShenyang - PIC, China1d ago
-
Senior-level Full TimeLOC3254: No.3239 Shenjiang Road, Shanghai, Pudong …1d ago
-
Simulation Engineer, Industrial Physics and Robotics CNY 360K-600KCUDA | Co-simulation | Contact mechanics | Controller co simulation | Deformable BodiesComprehensive benefits package | Mentorship | Supportive work environmentSenior-level Full TimeChina, Shanghai1d ago
-
Sr. System Software Engineer CNY 240K-480KAAC | ARM | Audio/Video | Audio/Video Encoding | BashOn-site support | Remote support | Technical consulting | TrainingSenior-level Full TimeChina Shanghai1d ago
-
Senior Software Engineer - Robot Compute Platform CNY 240K-480KC# | C++ | CAN bus | CUDA | Deterministic systemsSenior-level Full TimeShanghai, China2d ago
-
Motion Control Engineer - Actuator Control Algorithms CNY 360K-600KAnti Windup | BLDC | Cogging Compensation | Commutation | Control loopSenior-level Full TimeShanghai, China2d ago
-
Robotics Lead/ System Architect CNY 360K-600KActuation | CAD | Computer Architecture | Embedded Systems | Humanoid roboticsSenior-level Full TimeShanghai, China2d ago
-
CI/CD | Docker | ETL | FastAPI | FlaskEntry-level InternshipShanghai, YANGPU, China2d ago
-
Senior Gen AI Software Solutions Engineer CNY 240K-360KAutogen | C++ | Deep learning | Edge AI | EmbeddingsOn-site work modelSenior-level Full TimeCHN - Minhang, China2d ago