AI基础设施研发工程师(Sandbox / 容器化)-MiMo
Tasks
- Analyze bottlenecks and improve scheduling efficiency and stability
- Build containerized task execution platform with scheduling
- Build internal platform tools for task management and automation
- Create isolation and resource management mechanisms
- Design sandbox execution environment for RL training
- Implement security isolation with permission control and resource limits
- Implement task dispatch resource scheduling and environment reuse
- Integrate monitoring logging and observability for training tasks
Perks/Benefits
- N/A
Skills/Tech-stack
AppArmor | Argo Workflows | CI/CD | CPU resource scheduling | Cgroup | Containerd | Distributed Systems | Docker | ELK | Fault Tolerance | Firecracker | GPU resource scheduling | GVisor | Go | Grafana | JavaScript | Kata Containers | Kuberay | Kubernetes | Linux | Linux namespace | Logging | Loki | Monitoring | Networking | Observability | OpenTelemetry | Performance optimization | Permissions | Prometheus | Python | Queues | Ray | Resource Isolation | Resource scheduling | Rust | SELinux | SecComp | Service Discovery | Shell | Slurm | Task Scheduling | TypeScript | Volcano
Education
N/A
Related jobs
-
Entry-level Full Time北京 R4h ago
-
Nlp / Llm 应用工程师 CNY 180K-360KAgent evaluation | Anthropic API | Automated testing | Chroma | DeepEvalEntry-level Full Time北京 R4h ago
-
Mid-level Full Time北京 R4h ago
-
Mid-level Full Time北京 R4h ago
-
Entry-level Full Time北京、上海 R2d ago
-
Behavior Cloning | C++ | Cloud processing | Computer Vision | ControlEntry-level Internship北京、上海 R4d ago
-
Mid-level Full Time上海、深圳 R4d ago
-
AI platforms | API Development | Artificial Intelligence | Cloud AI | Cloud AI PlatformsMid-level Full TimeRemote, China R5d ago
-
AI ML Engineer CNY 280K-360KAWS | Azure | C++ | Cloud Computing | Computer VisionPerformance bonuses | Professional development opportunities | Remote workMid-level Full TimeShenzhen, Guangdong Province, China R8d ago
-
API Development | Artificial Intelligence | Cloud Computing | Data Pipelines | Data integrationMid-level Full TimeRemote, China R8d ago
-
AI工程师-Agent Memory & RAG 方向(成都) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time成都 R9d ago
-
AI工程师-Agent Memory & RAG 方向(武汉) CNY 240K-480KAlgorithms | BERT | Chroma | Cross-Encoder | Data StructuresSenior-level Full Time武汉 R9d ago
-
AI工程师-Agent Memory & RAG 方向(北京) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time北京 R9d ago
-
AWS | Agent Orchestration | Agent systems | Azure | DockerMid-level Full TimeShenzhen, Guangdong, China R10d ago
-
AVP, AI Solution Lead CNY 360K-600KCloud Computing | DataOps | DevOps | Flutter | Generative AIContinuous professional development | Flexible workingSenior-level Full TimeGuangzhou, Guangdong, China R10d ago
-
Analytics Modelling CNY 360K-600KAWS | BigQuery | Cloud platform | Google Cloud | Google Cloud PlatformContinuous professional development | Flexible working | Inclusive and diverse environment | Opportunities for growthSenior-level Full TimeGuangzhou, Guangdong, China R10d ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R10d ago
-
JMP_ AI Operation Excellence Expert(VM) CNY 240K-480KAI Agents | API | Cloud Native | Data Governance | Digital TwinSenior-level Full TimeSuzhou, Jiangsu, China R15d ago
-
Generative AI - ML System Engineering CNY 360K-600KC++ | CUDA | Compilation | Data pipeline | Diffusion ModelsFully remote option | On-site work flexibilitySenior-level Full TimeShanghai R15d ago
-
Lead Embedded Software Engineer CNY 349K-437KARM | BLE | C# | C++ | Embedded LinuxHybrid work model | Remote-friendly | Work from homeSenior-level Full TimeSuzhou, China R16d ago
-
模型部署与推理优化工程师 CNY 180K-360KC++ | Edge inference | Inference Performance | Inference Performance Optimization | Model DistillationMid-level Full Time北京 R28d ago
-
Entry-level Internship上海 R29d ago
-
Mid-level Full Time北京 R1mo ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R1mo ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R1mo ago