大模型算法工程师(开放域对话)
上海
CNY 180K-300K (estimate) Mid-level Internship
Tasks
- Accelerate inference with vLLM
- Apply GRPO PPO and DPO optimization
- Build end to end dialogue data pipelines
- Clean and deduplicate raw corpora
- Develop LLM algorithms for open domain dialogue systems
- Engineer prompts and tool use strategies
- Improve intent recognition personalization and task success metrics
- Improve multi turn dialogue planning accuracy with agentic reinforcement learning
- Optimize foundation models with SFT and RLHF
- Quantize and distill models for faster inference
- Reduce hallucinations in agentic tool use
- Run offline evaluations and online A B testing
- Support low latency deployment on cloud or edge
- Track dialogue state with DST methods
- Train reward model datasets for reinforcement learning
Perks/Benefits
Skills/Tech-stack
DPO | Deep learning | DeepSpeed | Distributed Training | Function Calling | GRPO | Hallucination reduction | Knowledge Distillation | Language Models | Large Language Models | Model Quantization | OpenRLHF | PPO | Prompt engineering | Python | RLAIF | RLHF | React | Reinforcement Learning | Reward Modeling | SFT | Thought Intermediate Result | Tool use | VLLM
Education
Language: zh
Views:
9
Clicks:
1
Saves: 0
Related jobs
-
Ai算法工程师 CNY 144K-240KBig Data | Big data processing | Data Processing | Deep learning | Feature EngineeringEntry-level Full Time深圳11h ago
-
【校招实习】Ai算法工程师 CNY 25K-37KComputer Vision | Data Analysis | Deep learning | Feature Engineering | HadoopInternship opportunityEntry-level Internship深圳12h ago
-
Entry-level Internship深圳12h ago
-
Mid-level Full Time北京 R13h ago
-
Entry-level Full Time北京 R14h ago
-
Ai数据闭环研发工程师 CNY 240K-360KData Distribution | Data Distribution Strategy | Data Flywheel | Data Mining | Data evaluationSenior-level Full Time上海、北京16h ago
-
Mid-level Full Time上海16h ago
-
Mid-level Full Time上海16h ago
-
Mid-level Full Time上海16h ago
-
Senior-level Full Time上海17h ago
-
Mid-level Internship上海、北京17h ago
-
[NCA and TW Ads] Senior Staff Machine Learning Engineer CNY 180K-300KContextual bandit | DIN | Deep Interest Network | Deep learning | Distributed SystemsSenior-level Full TimeShanghai, China21h ago
-
Action models | C++ | Data Generation | Dataset curation | Deep learningSenior-level Full TimeChina, Shanghai1d ago
-
Infrastructure Maintenance CNY 60K-60KArtificial Intelligence | ChatGPT | Data Analysis | Language Processing | Machine LearningEntry-level Full TimeChaoyang, BJ, CN1d ago
-
AI运维工程师(大模型推理 / AI Infra) CNY 180K-300KAlerting | Automation | Docker | GPU Acceleration | High AvailabilityEntry-level Full Time深圳1d ago
-
数据算法工程师 CNY 180K-300KAnomaly Detection | Automation | C plus plus | Computer Vision | Data AnnotationEntry-level Full Time上海1d ago
-
Entry-level Full Time上海1d ago
-
Entry-level Full Time上海1d ago
-
Entry-level Full Time上海1d ago
-
Mid-level Full TimeSuzhou, Jiangsu, China2d ago
-
Principal Specialist, AI Engineer CNY 360K-600KArtificial Intelligence | Deep learning | End to End | End to End AI | Language ProcessingSenior-level Full TimeCN-OCG International Center, Cheng Du, China2d ago
-
2026 Intern(3 months)-AI Software Enginner CNY 38K-50KAlgorithms | Android | Audio Video Decoding | Audio/Video | C#Entry-level InternshipShenzhen, Guangdong, China2d ago
-
Machine Learning Engineer CNY 248K-315KAndroid | C# | C++ | Embedded System | Embedded System ArchitectureMid-level Full TimeShanghai, Shanghai, China2d ago
-
Entry-level Full TimeShanghai, Shanghai, China2d ago
-
Machine Learning Engineer CNY 216K-300KAI acceleration | Android | C++ | Concurrency optimization | Embedded DevelopmentMid-level Full TimeShenzhen, Guangdong, China2d ago