大模型算法工程师(开放域对话)
Tasks
- Accelerate model inference with quantization distillation and efficient frameworks
- Apply RLHF and RLAIF optimization
- Build end to end dialogue data pipelines
- Clean and deduplicate raw training data
- Collaborate on distributed training and deployment
- Develop LLM algorithms for open domain dialogue
- Improve multi turn dialogue decision making using agentic RL
- Mitigate hallucinations in agent tool use
- Optimize base model with SFT
- Optimize intent recognition and personalization performance
- Perform prompt engineering
- Run offline evaluations and online A B testing
- Track dialogue state using DST
- Train reward model datasets for reinforcement learning
Perks/Benefits
- N/A
Skills/Tech-stack
A/B | A/B Testing | Agentic reinforcement learning | B testing | DeepSpeed | DisTaTch State Tracking DST | Distributed Training | Function Calling | Group Relative Policy Optimization | Group Relative Policy Optimization GRPO | Knowledge Distillation | LLM | Language Models | Large Language Models | Online Experimentation | OpenRLHF | Prompt engineering | Proximal Policy Optimization | Proximal Policy Optimization PPO | Python | Quantization | RLAIF | RLHF | React | Reinforcement Learning | Reward Modeling | Supervised Fine Tuning | Supervised Fine-Tuning (SFT) | Thought Intermediate Result | VLLM
Education
Bachelor of Arts | Bachelor of Engineering | Bachelor of Science
Regions
Countries
States
Related jobs
-
Senior-level Full Time上海、武汉、北京7h ago
-
算法工程师-大模型数据方向 CNY 240K-360KApache Spark | Clustering | Data Augmentation | Data Deduplication | Data GovernanceSenior-level Full Time上海7h ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent processing | Data Governance | ETL | Elasticsearch | Information ArchitectureFull-time employmentMid-level Full Time上海7h ago
-
Mid-level Full Time上海7h ago
-
Senior-level Full Time上海7h ago
-
Senior-level Full Time上海7h ago
-
Mid-level Full Time上海7h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KDistributed Training | Function Calling | GRPO | Human Feedback | JSONEntry-level Full Time上海、北京7h ago
-
Senior-level Full Time上海7h ago
-
Embedded Software Eng. CNY 180K-300KARM | ASPICE | Automotive Software | Automotive Software Development | C#Mid-level Full TimeWuhu, CN16h ago
-
AI/LLM Application Engineer CNY 280K-330KAPI | Access Control | Audit Logging | Authentication | AuthorizationMid-level Full TimeShenyang - PIC, China1d ago
-
AI/LLM Application Engineer CNY 280K-330KAccess Control | Audit Logging | Backend Development | Citation Generation | Document chunkingMid-level Full TimeShenyang - PIC, China1d ago
-
Senior-level Full TimeLOC3254: No.3239 Shenjiang Road, Shanghai, Pudong …1d ago
-
Simulation Engineer, Industrial Physics and Robotics CNY 360K-600KCUDA | Co-simulation | Contact mechanics | Controller co simulation | Deformable BodiesComprehensive benefits package | Mentorship | Supportive work environmentSenior-level Full TimeChina, Shanghai1d ago
-
Sr. System Software Engineer CNY 240K-480KAAC | ARM | Audio/Video | Audio/Video Encoding | BashOn-site support | Remote support | Technical consulting | TrainingSenior-level Full TimeChina Shanghai1d ago
-
Senior Software Engineer - Robot Compute Platform CNY 240K-480KC# | C++ | CAN bus | CUDA | Deterministic systemsSenior-level Full TimeShanghai, China1d ago
-
Motion Control Engineer - Actuator Control Algorithms CNY 360K-600KAnti Windup | BLDC | Cogging Compensation | Commutation | Control loopSenior-level Full TimeShanghai, China1d ago
-
Robotics Lead/ System Architect CNY 360K-600KActuation | CAD | Computer Architecture | Embedded Systems | Humanoid roboticsSenior-level Full TimeShanghai, China1d ago
-
CI/CD | Docker | ETL | FastAPI | FlaskEntry-level InternshipShanghai, YANGPU, China1d ago
-
Senior Gen AI Software Solutions Engineer CNY 240K-360KAutogen | C++ | Deep learning | Edge AI | EmbeddingsOn-site work modelSenior-level Full TimeCHN - Minhang, China2d ago
-
优才-多模态交互算法工程师-X-Lab CNY 240K-480KAttention | Benchmarking | Computer Vision | Deep learning | Hard Negative MiningSenior-level Full Time上海、深圳2d ago
-
Mid-level Full Time深圳 R2d ago
-
Mid-level Full Time北京 R2d ago
-
Robotic Embodied AI Engineer CNY 300K-355KAction Transformers | Action models | Autonomous Navigation | Computer Vision | Deep learningMid-level Full TimeBeijing, Beijing, China2d ago
-
Gaming AI Engineer CNY 304K-380KAlgorithms | Automatic Speech Recognition | C# | C++ | Computer ArchitectureMid-level Full TimeShenzhen, Guangdong, China2d ago