大模型算法工程师(开放域对话)
Tasks
- Accelerate inference with quantization distillation and efficient serving
- Apply GRPO PPO and DPO for policy optimization
- Build end to end dialogue data pipelines
- Clean and deduplicate raw corpora
- Collaborate with engineering for model deployment on edge and cloud
- Create evaluation datasets for interactive scenarios
- Develop LLM algorithms for open domain dialogue
- Fine tune base models with SFT
- Implement multi turn dialogue state tracking
- Improve multi turn decision planning accuracy
- Optimize intent recognition and personalization metrics
- Optimize models with RLHF and RLAIF
- Perform prompt engineering for better tool use
- Reduce hallucinations in agent tool use
- Run offline evaluation and online A B testing
- Train reward models for reinforcement learning
Perks/Benefits
Skills/Tech-stack
AI Feedback | Agentic tool use | DPO | DST | DeepSpeed | Dialogue State Tracking | Distributed Training | Function Calling | GRPO | Human Feedback | Inference acceleration | LLM | Language Models | Large Language Models | Learning from Human Feedback | Model Distillation | Model Quantization | Multi Turn Dialogue State Tracking | Multi-turn dialogue | PPO | Prompt engineering | Python | RLAIF | RLHF | React | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Reward Modeling | SFT | State tracking | Thought Intermediate Result | Tool use | VLLM
Education
Related jobs
-
大模型算法工程师-c端方向 CNY 240K-480KChain-of-Thought | Deep search | LLM Inference | LLM Training | Language ModelsMid-level Full Time北京6h ago
-
Entry-level Internship上海6h ago
-
Mid-level Full Time北京6h ago
-
【校招储备】算法实习生 CNY 25K-37KAgent Frameworks | Algorithms | Auto Tool Calling | Autogen | Data StructuresEntry-level Internship Temporary上海7h ago
-
Forward Deployed Architect, Generative AI, Google Cloud CNY 435K-500KAPIs | CI/CD | Data Pipelines | Data Sovereignty | ExperimentationSenior-level Full TimeBeijing, China; Shanghai, China13h ago
-
Senior-level Full TimeShanghai, CHN1d ago
-
Manager, AI Backend Developer CNY 240K-360KAI Agents | Inference acceleration | Linux | Machine Learning | MathematicsMid-level Full TimeAIA ED (Shanghai) Hongkou, China1d ago
-
Workload optimization intern CNY 38K-50KAgents | C++ | CUDA | Deep learning | GPU Kernel DevelopmentFlexible internship schedule | On-site workEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
AI Software Engineer Intern CNY 38K-50KAgent Development | Computer Vision | Deep learning | Fine Tuning | Inference OptimizationCareer development opportunities | Collaborative environment | On site work experienceEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
Data Analytics & Management, Officer CNY 330K-520KAgentic AI | Alteryx | Anomaly Detection | Anthropic Claude | ClassificationEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China1d ago
-
Data Analytics & Management, Officer CNY 330K-520KAccounting Constraints | Anomaly Detection | Classification | Code platforms | Data GovernanceEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China1d ago
-
Data Analytics & Management, Officer CNY 330K-520KAgentic AI | Alteryx | Anomaly Detection | Anthropic Claude | BCBS 239Employee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China1d ago
-
Anomaly Detection | BCBS 239 | Classification | Code platforms | Data GovernanceEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysExecutive-level Full TimeHangzhou, China1d ago
-
Data Analytics & Management, Officer CNY 330K-520KAgentic AI | Alteryx | Anomaly Detection | Classification | Copilot StudioEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China1d ago
-
Alteryx | Anomaly Detection | Code Automation | Code Development | Copilot StudioEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer days | Vibrant employee networksExecutive-level Full TimeHangzhou, China1d ago
-
A/B | A/B Testing | Agentic Workflows | B testing | Fine TuningCareer development | Coaching and mentoringEntry-level Internship Part TimeShanghai - Phincas 2, China1d ago
-
Entry-level Internship合肥1d ago
-
多模态大模型数据算法实习生 CNY 25K-37KComputer Vision | Data Augmentation | Data Processing | Data Quality | Deep learningInternshipEntry-level Internship北京1d ago
-
Entry-level Internship上海1d ago
-
Senior-level Full Time上海、苏州、北京、广州、深圳1d ago
-
Mid-level Full Time北京、上海、苏州1d ago
-
Senior-level Full Time上海、苏州、北京、深圳1d ago
-
Deep Learning Planning R&D Engineer CNY 180K-300KC++ | Data Structures | Data Structures and Algorithms | Deep learning | Deep reinforcement learningMid-level Full Time上海、苏州、北京、深圳1d ago
-
Mid-level Full Time北京、深圳、苏州、上海1d ago
-
Senior-level Full Time北京、上海1d ago