MiMo-大模型训练框架开发工程师
Tasks
- Accelerate data loading and preprocessing
- Build performance monitoring system
- Design training framework
- Develop and optimize large model training pipeline
- Diagnose training bottlenecks
- Improve memory and GPU memory efficiency
- Manage CI/CD workflows
- Optimize distributed training communication
- Perform performance evaluation and tuning
- Tune parallelism strategy
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CI/CD | DP | Data Preprocessing | Data loading | DeepSpeed | Distributed Training | EP | Infiniband | Megatron-LM | Memory Optimization | NCCL | Nsight | PP | Performance Monitoring | PyTorch | Python | RoCEv2 | TP
Education
N/A
Related jobs
-
Mid-level Full Time北京3h ago
-
大模型算法专家 CNY 240K-480KAgentic RL | Language Models | Large Language Models | Linux | Multimodal GenerationSenior-level Full Time北京4h ago
-
Senior-level Full Time北京6h ago
-
Senior-level Full Time上海1d ago
-
【算法】多模态/大模型算法专家(上海) CNY 240K-480KAgent Framework | C plus plus | Computer Vision | Language Processing | LinuxSenior-level Full Time上海1d ago
-
【算法】计算机视觉算法专家(上海) CNY 240K-480KAnomaly Detection | C++ | Computer Vision | Deep learning | Fine-grained classificationSenior-level Full Time上海1d ago
-
大模型应用算法工程师/专家 CNY 240K-480KC++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer DialogueSenior-level Full Time上海、北京1d ago
-
大模型应用算法实习生 CNY 25K-37KAgentic Systems | C++ | Customer Service | Customer Service Automation | Deep learningOne-on-one mentorship | Technical workshopsEntry-level Internship上海1d ago
-
【算法】计算机视觉算法专家(杭州) CNY 240K-480K3D Vision | Anomaly Detection | C++ | Computer Vision | Image RetrievalSenior-level Full Time杭州1d ago
-
机器学习特征/样本数据工程研发 CNY 180K-300KC++ | Caching | Data Engineering | Data Governance | Feature EngineeringEntry-level Full Time北京、上海1d ago
-
Senior Perception Algorithm Engineer CNY 400K-540K3D Geometry | BEV | C# | C++ | Camera CalibrationSenior-level Full Time5-8F TOWER C, 788 JINZHONG ROAD, …1d ago
-
Principal Perception Algorithm Engineer CNY 400K-540K3D Geometric Transformations | BEV Depth | BEV Det | C# | C++Senior-level Full Time5-8F TOWER C, 788 JINZHONG ROAD, …1d ago
-
校招-机器人感知算法开发工程师(目标检测方向) CNY 240K-360K3D Reconstruction | C++ | Camera Calibration | Cloud processing | Computer VisionNone Full Time上海、合肥、北京2d ago
-
Java数据平台开发 CNY 180K-420KAliyun DataWorks | Apache Flink | Apache Hadoop | Apache Hadoop HDFS | Apache HiveMid-level Full Time上海2d ago
-
Entry-level Full Time深圳2d ago
-
Internship: Data Engineer CNY 38K-50KData Validation | Data Warehousing | Data pipeline | ETL | PythonEntry-level InternshipSu Zhou Shi, Jiang Su Sheng, …2d ago
-
Data Analysis Engineer (数据分析工程师) CNY 25K-37KAnomaly Detection | Classification | Clustering | Data Visualization | MatplotlibEntry-level Full TimeGuangzhou, China2d ago
-
Data Analysis Engineer (数据分析工程师--生产) CNY 25K-37KCMMS | Data Analysis | Data Security | Data Visualization | EAMEntry-level Full TimeGuangzhou, China2d ago
-
Full Stack Engineer, AI systems CNY 360K-500KAlerting | Amazon DynamoDB | Distributed Systems | Docker | EmbeddingsSenior-level Full TimeChina3d ago
-
数据后端工程师 CNY 25K-37KAWS | Alerting | Alibaba Cloud | Arrow | CI/CDCost optimization support | InternshipEntry-level Internship深圳4d ago
-
Entry-level Internship深圳4d ago
-
Entry-level Internship上海5d ago
-
DataOps CNY 192K-216KAlicloud | Application Monitoring | Best practices | CI/CD | Change ManagementFlexible paid time off | Insurance benefits | Modern office facilities | Professional development | Team-building activitiesMid-level Full TimeShanghai, China5d ago
-
Audit Logging | CI Pipeline | CI/CD | Compliance | Data PrivacySenior-level Full TimeShanghai, Shanghai, China5d ago
-
Entry-level Full TimeChina Shanghai5d ago