大模型训练与推理Infra工程师-MiMo
Tasks
- Analyze data loading and communication bottlenecks
- Build model inference frameworks
- Collaborate with research teams
- Develop distributed training infrastructure
- Develop monitoring and profiling tools
- Integrate high-performance computing technologies
- Maintain distributed training frameworks
- Optimize hardware utilization
- Optimize inference latency and memory
- Optimize model training performance
- Support model deployment for product teams
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CUDA | CUDNN | Context caching | DeepSpeed | Docker | GPU | Horovod | JAX | Kubernetes | MPI | Mixed Precision | Mixed-precision training | Model Quantization | NCCL | NPU | Pruning | PyTorch | Python | Ray | TensorFlow | Terraform
Education
Related jobs
-
Entry-level Full Time北京 R4h ago
-
Entry-level Full Time北京 R5h ago
-
Mid-level Full Time北京 R5h ago
-
Mid-level Full Time北京 R5h ago
-
Ai系统软件实习生 CNY 37K-37KAgent Development | C++ | GPU Computing | HPC | High PerformanceFlexible schedule | Remote workEntry-level Internship上海 R12d ago
-
Mid-level Full Time北京 R15d ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R24d ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R24d ago
-
A/B | A/B Testing | AWS | B testing | Cohort AnalysisComprehensive benefits package | Flexible work model | Work from home flexibilityMid-level Full TimeShanghai, China R1mo ago
-
Senior Firmware Engineer CNY 300K-390KAlgorithms | Automated testing | C++ | CI/CD | ContainerizationFlexible work schedule | Global teamwork opportunitySenior-level Full TimeChina - Sichuan - Chengdu - … R1mo ago
-
Field Application Engineer (Machine Learning) CNY 417K-540KC/C++ | CUDA | Customer support | Development Process | DockerFamily leave | Medical/Dental/Vision | Paid time off | Stock option | Training and developmentSenior-level Full TimeChina - Remote R1mo ago