AI基础设施研发工程师(Sandbox / 容器化)-MiMo
Tasks
- Analyze system bottlenecks and optimize throughput and stability
- Build containerized task execution platform with Docker and Kubernetes
- Create security isolation mechanisms for code execution
- Design sandbox execution environment for RL training
- Develop task management and runtime monitoring tools
- Implement container scheduling and resource isolation
- Scale RL training infrastructure with task distribution and recovery
Perks/Benefits
- N/A
Skills/Tech-stack
AppArmor | Argo Workflows | CPU resource scheduling | Cgroup | Containerd | Distributed Systems | Docker | ELK | Fault Recovery | Firecracker | GPU resource scheduling | GVisor | Go | Grafana | JavaScript | Kata Containers | Kuberay | Kubernetes | Linux | Linux Namespaces | Logging | Loki | Monitoring and Alerting | Observability | OpenTelemetry | Prometheus | Python | Ray | Resource scheduling | Rust | SELinux | SecComp | Service Discovery | Shell | Slurm | Task Scheduling | TypeScript | Volcano
Education
N/A
Roles
AI | AI Infrastructure Engineer | Backend | Backend Engineer | DevOps | DevOps Engineer | Engineer | Infrastructure Engineer
Related jobs
-
Entry-level Full Time北京 R3h ago
-
Entry-level Full Time北京 R3h ago
-
Mid-level Full Time北京 R3h ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R3h ago
-
JMP_ AI Operation Excellence Expert(VM) CNY 240K-480KAI Agents | API | Cloud Native | Data Governance | Digital TwinSenior-level Full TimeSuzhou, Jiangsu, China R3d ago
-
Lead Embedded Software Engineer CNY 349K-437KARM | BLE | C# | C++ | Embedded LinuxHybrid work model | Remote-friendly | Work from homeSenior-level Full TimeSuzhou, China R4d ago
-
模型部署与推理优化工程师 CNY 180K-360KC++ | Edge inference | Inference Performance | Inference Performance Optimization | Model DistillationMid-level Full Time北京 R16d ago
-
Entry-level Internship上海 R17d ago
-
Ai系统软件实习生 CNY 37K-37KAgent Development | C++ | GPU Computing | HPC | High PerformanceFlexible schedule | Remote workEntry-level Internship上海 R1mo ago
-
Mid-level Full Time北京 R1mo ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R1mo ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R1mo ago
-
A/B | A/B Testing | AWS | B testing | Cohort AnalysisComprehensive benefits package | Flexible work model | Work from home flexibilityMid-level Full TimeShanghai, China R1mo ago