Senior AI Engineer
Beijing Yizhuang, China
CNY 240K-480K (estimate) Senior-level Full Time
Tasks
- Build fine tuning pipeline templates
- Coordinate infrastructure requirements with science teams
- Create runbooks for failure handling checkpoint recovery
- Define AI engineering standards
- Design distributed training infrastructure
- Establish MLOps experiment tracking model registry CI CD
- Implement compute orchestration policies
- Optimize data loading pipelines
- Optimize training for GPU efficiency
- Set scheduling policies for GPU allocation
Perks/Benefits
- N/A
Skills/Tech-stack
Adapters | CI/CD | DDP | DeepSpeed | Distributed Training | Experiment tracking | FSDP | Fine Tuning | FlashAttention | Kubeflow | Kubernetes | LoRA | MLOps | Model Registry | PyTorch | QLoRA | Run | Slurm | Transformer
Education
N/A
Roles
Related jobs
-
Senior-level Full TimeShanghai, China1d ago
-
数据平台工程师 CNY 180K-300KAWS | Azure | CI/CD | CloudFormation | Data GovernanceFlexible work arrangements | In-person collaborationMid-level Full TimeSHC01 - DXC Shanghai Campus Phase …1d ago
-
Mid-level Full Time北京 R1d ago
-
Entry-level Full Time北京 R1d ago
-
高级算法工程师(Nlp方向) CNY 240K-480KAgent Development | Agent development tools | Agent memory | CUDA | ChromaSenior-level Full Time北京1d ago
-
Mid-level Full Time北京 R1d ago
-
具身世界模型训练INFRA工程师 - XiaomiRobotics CNY 180K-360KDeep learning | DeepSpeed | Distributed Training | Fault Tolerance | KubernetesMid-level Full Time北京1d ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R1d ago
-
Entry-level Full Time北京1d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CPU resource scheduling | Cgroup | ContainerdMid-level Full Time北京 R1d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳1d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳1d ago
-
Entry-level Full Time北京1d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Function Calling | GRPOEntry-level Full Time上海、北京1d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳1d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳1d ago
-
Mid-level Full TimeBeijing, China2d ago
-
Mid-level Full TimeChina Shanghai2d ago
-
Agent systems | Cloud deployment | Containerization | Data Analysis | Data PipelinesFlexible work model | In-person collaborationSenior-level Full TimeWSI01 - DXC Wuhan Optical Valley …2d ago
-
Senior-level Full TimeWuxi, Jiangsu, China3d ago
-
Entry-level Internship上海3d ago
-
Entry-level Full Time上海3d ago
-
数据算法工程师(实习生) CNY 25K-37KC++ | Data Generation | Data Modeling | Data Transformation | Data cleaningInternshipEntry-level Internship上海3d ago
-
nlp算法工程师-2027届 CNY 25K-37KDeep learning | DeepSpeed | Fine Tuning | Information Retrieval | Language ProcessingEntry-level Internship武汉4d ago
-
AI Agent Engineer(Embededd Software Tooling)_ETAS CNY 240K-480KAgent architecture | C++ | Deep learning | Edge AI | Embedded SoftwareSenior-level Full TimeShanghai, Shanghai, China5d ago