大模型训练与推理Infra工程师-MiMo
Tasks
- Analyze and optimize data loading, communication, and hardware utilization
- Build online and offline model inference framework
- Collaborate with model research teams
- Develop distributed training compute platform
- Develop monitoring and profiling toolchain
- Implement inference optimization techniques
- Integrate high-performance computing technologies
- Maintain distributed training frameworks
- Optimize inference speed memory and energy
- Support model deployment for product teams
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CUDA | CUDNN | Context caching | DeepSpeed | Distributed Training | Docker | GPU | Horovod | JAX | Kubernetes | MPI | Mixed Precision | Mixed-precision training | Model Quantization | NCCL | NPU | Pruning | PyTorch | Python | Ray | TensorFlow | Terraform
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
Mid-level Full Time上海2h ago
-
Mid-level Full Time深圳、上海2h ago
-
Mid-level Full Time北京2h ago
-
机器人大模型软件工程师 CNY 180K-300KAgent systems | Asynchronous programming | C++ | Context Scheduling | Conversation ManagementMid-level Full Time深圳2h ago
-
Mid-level Full Time广州2h ago
-
Mid-level Full Time广州2h ago
-
驾舱一体专家/高级专家/总监 CNY 240K-480KComputer Vision | Data Generation | Data Preprocessing | Data cleaning | Deep learningSenior-level Full Time北京、上海、深圳、广州2h ago
-
Entry-level Full Time深圳、上海、北京2h ago
-
Mid-level Full Time深圳、上海、北京、中国香港2h ago
-
Data Visualization | Data alignment | Data labeling | Data pipeline | Dataset integrationMid-level Full Time深圳、上海2h ago
-
Software Engineer (All Levels) – 大模型与智能机器人系统 CNY 240K-480KC++ | CUDA | DDS | GPU memory | GPU memory managementEntry-level Full Time广州、深圳2h ago
-
Agent systems | LLM | Machine Learning | Multi-Agent | Multi-Agent SystemsMid-level Full Time深圳、上海2h ago
-
Entry-level Full Time InternshipBeijing2h ago
-
多模态大模型算法工程师(Vlm / 自动驾驶方向) CNY 180K-264KAgent modeling | Autonomous Driving | Autoregressive modeling | BEV | Behavior ModelingEntry-level Full Time北京、苏州3h ago
-
大模型算法专家 CNY 240K-480KAgentic RL | Language Models | Large Language Models | Linux | Multimodal GenerationSenior-level Full Time北京5h ago
-
Entry-level Full Time北京7h ago
-
Senior-level Full Time北京7h ago
-
Senior-level Full Time北京1d ago
-
Senior-level Full Time上海1d ago
-
【算法】多模态/大模型算法专家(上海) CNY 240K-480KAgent Framework | C plus plus | Computer Vision | Language Processing | LinuxSenior-level Full Time上海1d ago
-
【算法】计算机视觉算法专家(上海) CNY 240K-480KAnomaly Detection | C++ | Computer Vision | Deep learning | Fine-grained classificationSenior-level Full Time上海1d ago
-
大模型应用算法工程师/专家 CNY 240K-480KC++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer DialogueSenior-level Full Time上海、北京1d ago
-
大模型应用算法实习生 CNY 25K-37KAgentic Systems | C++ | Customer Service | Customer Service Automation | Deep learningOne-on-one mentorship | Technical workshopsEntry-level Internship上海1d ago
-
【算法】计算机视觉算法专家(杭州) CNY 240K-480K3D Vision | Anomaly Detection | C++ | Computer Vision | Image RetrievalSenior-level Full Time杭州1d ago
-
机器学习特征/样本数据工程研发 CNY 180K-300KC++ | Caching | Data Engineering | Data Governance | Feature EngineeringEntry-level Full Time北京、上海1d ago