大模型训练与推理Infra工程师-MiMo
Tasks
- Analyze and optimize data loading, communication, and hardware utilization
- Build model inference framework
- Collaborate with research teams on training and inference strategy
- Develop distributed model training platform
- Develop performance monitoring and analysis tools
- Implement inference optimization techniques
- Integrate high-performance computing technologies
- Maintain distributed training frameworks
- Optimize inference speed memory and energy
- Support product team model deployment
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CUDA | CUDNN | DeepSpeed | Docker | GPU | Horovod | JAX | Kubernetes | MPI | Mixed Precision | Mixed-precision training | NCCL | NPU | Pruning | PyTorch | Python | Quantization | Ray | TensorFlow | Terraform | Transformer
Education
Related jobs
-
Mid-level Full Time北京 R2d ago
-
Entry-level Full Time北京 R4d ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R6d ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R6d ago
-
A/B | A/B Testing | AWS | B testing | Cohort AnalysisComprehensive benefits package | Flexible work model | Work from home flexibilityMid-level Full TimeShanghai, China R19d ago
-
Senior Firmware Engineer CNY 300K-390KAlgorithms | Automated testing | C++ | CI/CD | ContainerizationFlexible work schedule | Global teamwork opportunitySenior-level Full TimeChina - Sichuan - Chengdu - … R1mo ago
-
Field Application Engineer (Machine Learning) CNY 417K-540KC/C++ | CUDA | Customer support | Development Process | DockerFamily leave | Medical/Dental/Vision | Paid time off | Stock option | Training and developmentSenior-level Full TimeChina - Remote R1mo ago