MiMo-大模型训练框架开发工程师
Tasks
- Accelerate data loading and preprocessing
- Build performance monitoring and alerting
- Conduct performance evaluation and tuning
- Develop deep learning training framework
- Enable communication computation overlap
- Identify performance bottlenecks
- Improve memory and GPU utilization
- Optimize distributed parallelism strategies
- Optimize large model training efficiency stability scalability
- Solve distributed training communication issues
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CI/CD | CUDA | Data loading | DeepSpeed | Distributed Training | Infiniband | Megatron-LM | NCCL | Nsight | Performance Monitoring | Profiling | PyTorch | Python | RoCEv2
Education
N/A
Related jobs
-
Entry-level Full Time北京 R3h ago
-
Entry-level Full Time北京 R5h ago
-
Mid-level Full Time北京 R5h ago
-
Mid-level Full Time北京 R6h ago
-
Mid-level Full Time北京 R15d ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R24d ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R24d ago
-
Senior Firmware Engineer CNY 300K-390KAlgorithms | Automated testing | C++ | CI/CD | ContainerizationFlexible work schedule | Global teamwork opportunitySenior-level Full TimeChina - Sichuan - Chengdu - … R1mo ago
-
Field Application Engineer (Machine Learning) CNY 417K-540KC/C++ | CUDA | Customer support | Development Process | DockerFamily leave | Medical/Dental/Vision | Paid time off | Stock option | Training and developmentSenior-level Full TimeChina - Remote R1mo ago