大语言模型后训练算法工程师
Tasks
- Build and optimize model evaluation metrics
- Collaborate with data and product teams
- Define delivery standards and collect feedback
- Deploy and optimize inference services
- Design LLM reinforcement learning experiments
- Design LLM supervised fine tuning experiments
- Improve training framework stability
- Optimize cloud training pipeline efficiency
- Optimize data loading and resource scheduling
- Optimize distributed training communication
- Optimize model accuracy safety and consistency
Perks/Benefits
- N/A
Skills/Tech-stack
Distributed Training | Docker | Fine Tuning | GPU Training | Kubernetes | LLM Inference | Language Models | Large Language Models | Model Evaluation | Multi-GPU | Multi-GPU Training | PyTorch | Python | Ray Serve | Reinforcement Learning | SGLang | Supervised Fine Tuning | VLLM
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer
Regions
Countries
States
Related jobs
-
Mid-level Full Time北京 R15h ago
-
大模型算法研究员-MiMo CNY 500K-500KActive Learning | C++ | Curriculum learning | Data Generation | Deep learningEntry-level Full Time北京15h ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480K5G | Cloud API | Consistency protocols | Data Compression | Data PrivacyHybrid workSenior-level Full Time北京 R15h ago
-
Entry-level Full Time北京 R17h ago
-
高级算法工程师(Nlp方向) CNY 240K-480KAgent Development | Agent development tools | Agent memory | CUDA | ChromaSenior-level Full Time北京17h ago
-
Entry-level Full Time北京 R17h ago
-
Ai数据产品经理 CNY 240K-420KAgent Orchestration | Context Completion | Data Warehouse | Dimensional Modeling | ETLMid-level Full Time北京17h ago
-
Mid-level Full Time北京 R17h ago
-
具身世界模型训练INFRA工程师 - XiaomiRobotics CNY 180K-360KDeep learning | DeepSpeed | Distributed Training | Fault Tolerance | KubernetesMid-level Full Time北京17h ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R17h ago
-
Entry-level Full Time北京17h ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CPU resource scheduling | Cgroup | ContainerdMid-level Full Time北京 R17h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳18h ago
-
Ai应用工程师(提效方向 0-1) CNY 50K-50KAI Programming | AI Programming Tools | API Integration | JavaScript | Language ProcessingEngineering resource support | Hands-on product development | Model and compute support | Real world usageEntry-level Internship深圳18h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳18h ago
-
Entry-level Full Time北京18h ago
-
Entry-level Internship北京18h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Function Calling | GRPOEntry-level Full Time上海、北京18h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳18h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳18h ago
-
Entry-level Full Time北京、上海19h ago
-
AGI 服务端资深工程师-Talkie&星野 CNY 180K-300KData Engineering | Dify | Distributed Systems | Go | Inference OptimizationMid-level Full Time北京、上海19h ago
-
Mid-level Full TimeBeijing, China1d ago
-
Mid-level Full TimeChina Shanghai1d ago
-
Asynchronous programming | Dashboards | Data Observability | Data Validation | DatabasesMid-level Full TimeChina, Shanghai1d ago