大模型训练与推理Infra工程师-MiMo
Tasks
- Analyze and optimize data loading, communication, and hardware utilization
- Build distributed training compute platform
- Build performance monitoring toolchain
- Collaborate with research teams to tailor training and inference strategy
- Develop inference framework for online and offline serving
- Implement inference optimization techniques
- Integrate high-performance computing technologies
- Maintain distributed training framework
- Optimize inference latency memory and energy usage
- Support model deployment for product teams
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CUDA | CUDNN | Deep learning | DeepSpeed | Distributed Training | Docker | GPU | Horovod | JAX | Kubernetes | MPI | Mixed Precision | Mixed-precision training | NCCL | NPU | Pruning | PyTorch | Python | Quantization | Ray | TensorFlow | Terraform
Education
Related jobs
-
大模型算法研究员-MiMo CNY 500K-500KActive Learning | C++ | Curriculum learning | Data Augmentation | Deep learningMid-level Full Time北京9h ago
-
多模态大模型算法工程师(偏Llm) CNY 240K-480KComputer Vision | Deep learning | Fine Tuning | Language Models | Large Language ModelsMid-level Full Time北京9h ago
-
Senior-level Full Time北京9h ago
-
Senior-level Full Time北京10h ago
-
软件工程师 - 模型训练基础建设 CNY 180K-360KCI/CD | Containerization | Data Preprocessing | Deep learning | DeepSpeedEntry-level Full Time广州10h ago
-
Agent Frameworks | Anonymization | Boundary testing | Case design | Data PrivacyEntry-level Full Time InternshipBeijing, Beijing, China21h ago
-
大模型算法工程师--c端方向 CNY 240K-480KChain-of-Thought | Deep search | Information Retrieval | LLM Inference | LLM TrainingMid-level Full Time北京1d ago
-
AI Intern – RAG Engineering CNY 37K-44KContext Construction | Dify | Document parsing | Information Retrieval | LangchainEntry-level Full Time Internship北京市, 北京市, 中国1d ago
-
AI Intern – Agent & LLM Solutions CNY 37K-48KDify | Langchain | Langgraph | Language Models | Large Language ModelsEntry-level Full Time Internship北京市, 北京市, 中国1d ago
-
ANSYS | APDL | AVL Excite | C# | Durability analysisSenior-level Full TimeWuhan, Hubei, China2d ago
-
A/B | A/B Testing | AWS | B testing | Cohort AnalysisComprehensive benefits package | Flexible work model | Work from home flexibilityMid-level Full TimeShanghai, China R2d ago
-
ASIC Design Flow | ASIC design | C plus plus | Design flow | Low powerMid-level Full TimeChina, Shanghai2d ago
-
大模型算法实习生 CNY 36K-37KDeep learning | DeepSpeed | Distributed Training | GPU Training | JavaLarge scale text data access | Stable internship opportunity | Supportive team environment | Technical mentorshipEntry-level Internship北京、上海3d ago
-
大模型算法-校招 CNY 500K-500KDeep learning | DeepSpeed | Distributed Training | GPU Training | Information ExtractionLarge-scale datasets | NLP application projects | Relaxed team atmosphere | Technical mentorshipEntry-level Full Time上海、北京3d ago
-
Entry-level Full Time北京3d ago
-
高级Ai系统开发工程师(大模型与Rag方向) CNY 240K-480KAgent workflow | Caching | Distributed Systems | Dynamic batching | ElasticsearchSenior-level Full Time武汉3d ago
-
Senior-level Full Time北京3d ago
-
Senior-level Full Time北京3d ago
-
Senior-level Full TimeChina4d ago
-
Research Intern (AI Agent) CNY 25K-37KAgent systems | Embodied AI | Language Models | Large Language Models | Memory-augmented systemsEntry-level Full Time Internship深圳4d ago
-
具身智能算法实习生 (Manipulation) CNY 25K-37KCLIP | Computer Vision | Deep learning | Diffusion Model | Fine TuningEntry-level Internship深圳4d ago
-
Agi 后端工程师-下一代Ai数据链路 CNY 180K-360KCache | Data Processing | Data Storage | Data cleaning | Data pipelineEntry-level Full Time北京、上海4d ago
-
校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 500K-500KAdapters | Direct Preference Optimization | Fine Tuning | Flax | Function designNone Full Time上海4d ago
-
Activation Function | Architecture Design | Automated testing | CI/CD | Computer VisionBirthday off | Flexible working hours | Local holidays | Onsite work | Paid vacationMid-level Full TimeShenzhen4d ago
-
Entry-level Internship Part TimeChina4d ago