大模型训练与推理Infra工程师-MiMo
Tasks
- Analyze and optimize data loading, communication, and hardware utilization
- Build inference framework
- Collaborate with research teams on training and inference strategies
- Develop distributed training platform
- Develop monitoring and profiling tools
- Implement inference optimization techniques
- Integrate high-performance computing technologies
- Maintain distributed training framework
- Optimize inference speed memory and energy
- Support model deployment for product teams
Perks/Benefits
- N/A
Skills/Tech-stack
C plus plus | CUDA | CUDNN | DeepSpeed | Docker | GPU | Horovod | JAX | Kubernetes | MPI | Mixed Precision | Mixed-precision training | Model Pruning | Model Quantization | NCCL | NPU | PyTorch | Python | Ray | TensorFlow | Terraform | Transformer
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
Entry-level Full Time北京 R9h ago
-
Entry-level Full Time北京 R10h ago
-
Mid-level Full Time北京 R10h ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R10h ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CPU resource scheduling | Cgroup | ContainerdMid-level Full Time北京 R10h ago
-
JMP_ AI Operation Excellence Expert(VM) CNY 240K-480KAI Agents | API | Cloud Native | Data Governance | Digital TwinSenior-level Full TimeSuzhou, Jiangsu, China R3d ago
-
Lead Embedded Software Engineer CNY 349K-437KARM | BLE | C# | C++ | Embedded LinuxHybrid work model | Remote-friendly | Work from homeSenior-level Full TimeSuzhou, China R5d ago
-
Mid-level Full Time上海、深圳 R10d ago
-
模型部署与推理优化工程师 CNY 180K-360KC++ | Edge inference | Inference Performance | Inference Performance Optimization | Model DistillationMid-level Full Time北京 R17d ago
-
Entry-level Internship上海 R17d ago
-
Ai系统软件实习生 CNY 37K-37KAgent Development | C++ | GPU Computing | HPC | High PerformanceFlexible schedule | Remote workEntry-level Internship上海 R1mo ago
-
Mid-level Full Time北京 R1mo ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R1mo ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R1mo ago
-
A/B | A/B Testing | AWS | B testing | Cohort AnalysisComprehensive benefits package | Flexible work model | Work from home flexibilityMid-level Full TimeShanghai, China R1mo ago