Senior Manager, AI Infrastructure
Shanghai Jing'An Office, China
CNY 435K-500K (estimate) Senior-level Full Time
Tasks
- Build and mentor full stack infrastructure team
- Deploy Kubernetes based model serving
- Design AI factory orchestration for training and inference
- Ensure infrastructure compliance with data residency and security regulations
- Host and manage infrastructure for AI agents
- Implement FinOps for AI inference economics
- Implement LLMOps pipelines for prompt versioning and vector database scaling
- Implement carbon aware scheduling for green AI
- Lead distributed training optimizations with DeepSpeed and Megatron
- Manage GPU cluster and high performance interconnect networking
- Manage vendor strategy and resilient hardware and cloud supply chain
- Translate research requirements into infrastructure and finance needs
- Tune models with hardware aware performance engineering
- Use observability tools for real time hardware telemetry
Perks/Benefits
- N/A
Skills/Tech-stack
AI Agents | Carbon Aware Scheduling | Cluster management | DeepSpeed | Distributed Training | FinOps | GPU Cluster | GPU Cluster Management | Grafana | High Performance | High-Performance Computing | Infiniband | Kubeflow | Kubernetes | LLMOps | Liquid cooling | Megatron | Model Serving | Observability | Performance Computing | Prometheus | PyTorch | Ray | RoCE v2 | Vector Databases
Education
N/A
Related jobs
-
Mid-level Full Time北京 R4h ago
-
Audit Logging | CI/CD | Data Governance | Data Privacy | Drift DetectionSenior-level Full TimeShanghai, Shanghai, China12h ago
-
Senior AI Engineer CNY 240K-480KAgent Orchestration | Authentication | Authorization | CI Gates | CI/CDSenior-level Full TimeChina15h ago
-
【26届校招】大语言模型后训练算法工程师(Foundation Model) CNY 240K-480KData loading | Distributed Training | Docker | Fine Tuning | Inference OptimizationEntry-level Full Time上海、深圳1d ago
-
数据算法工程师(实习生) CNY 25K-37KAnomaly Filtering | C++ | Data Generation | Data Processing | Data cleaningInternshipEntry-level Internship上海1d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KContainerd | Distributed Systems | Docker | ELK | File SystemMid-level Full Time北京 R1d ago
-
大模型算法工程师-World Play CNY 180K-300KAI Agents | AIGC | Agent systems | Dialogue Systems | Emotion recognitionMid-level Full Time北京、上海1d ago
-
AI Specialist (m/f/d) - China CNY 144K-240KComputer Vision | Data Analysis | Data Preparation | Data integration | Language ProcessingCross-functional team collaboration | International rotation program | Permanent position transitionEntry-level Full TimeShanghai, Shanghai, China1d ago
-
AI Specialist (m/f/d) - China CNY 144K-240KAI integration | Data Analysis | Data Preparation | Machine Learning | Model EvaluationCross-functional team experience | International rotation programEntry-level Full TimeLanzhou, Gansu, China1d ago
-
Senior AI Software Engineer CNY 240K-480KAPI Integration | Autogen | Chain-of-Thought | CrewAI | LLM APIsGlobal team collaboration | Growth opportunities | Inclusive work environmentSenior-level Full TimeChengdu, China1d ago
-
Ai算法实习生(振动与力学方向) CNY 25K-37KAPI Integration | Convolutional Neural Networks | Deep learning | Keras | LangchainEntry-level Internship深圳3d ago
-
Mid-level Full Time上海3d ago
-
Mid-level Full Time上海3d ago
-
Mid-level Full Time上海、深圳 R4d ago
-
Mid-level Full Time东莞4d ago
-
AI Engineer / Senior AI Engineer CNY 240K-480KAWS | Agentic Workflow | Agentic Workflow Orchestration | Azure | CI/CDSenior-level Full TimeShanghai Jing'An Office, China4d ago
-
None Full Time南京5d ago
-
Senior Developer Relations Manager CNY 348K-480KAPIs | Accelerated computing | Agentic AI | CUDA | Data MigrationSenior-level Full TimeChina, Beijing5d ago
-
AI Research Scientist - Medical Imaging Analysis CNY 192K-233K3D Image Processing | 3D Medical Image Segmentation | Attention Mechanisms | Computer Vision | Contrastive LearningAccess to computing resources | Conference support | Onsite work | Professional developmentMid-level Full TimeBeijing Yizhuang, China5d ago
-
【27届实习】云原生Ai平台研发工程师-杭州 CNY 25K-37KArgo Workflow | CET4 | Computer Vision | Docker | GolangFull-time opportunityEntry-level Internship杭州6d ago
-
JMP-Chief AI Software Technologist (BCSC) CNY 240K-390KA2A | Agent systems | CI/CD | Data Governance | Data ModelingExecutive-level Full TimeWuxi, Jiangsu, China6d ago
-
【26届校招】基础模型与多模态大模型算法工程师 CNY 216K-360KBERT | BLIP | C++ | CLIP | Data pipelineCareer growth path | Compute resources | Innovation freedom | Research resources | Training programEntry-level Full Time深圳7d ago
-
AWS | Agent systems | Agentic Infrastructure Pipelines | Agentic Workflow | Agentic infrastructureEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China7d ago
-
AIOps | Agent systems | Capacity Planning | Cloud infrastructure | ContainerizationEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China7d ago
-
Cloud Platforms | Computer Vision | Containerization | Deep learning | DockerExecutive-level Full TimeCN-Shenzhen-HyQ, China12d ago