System Software Engineer, Platform Operations
Tasks
- Coordinate cross functional operational best practices
- Develop operational plans
- Drive continuous improvement initiatives
- Implement SRE principles
- Lead incident response
- Manage live training event deployments
- Oversee platform stability and reliability
- Resolve emergent production issues
Perks/Benefits
- N/A
Skills/Tech-stack
AKS | AWS | AWS SNS | AWS SQS | Azure | Azure Service | Azure Service Bus | CI/CD | Docker | EKS | Event Driven | Event-driven architecture | GKE | Generative AI | GitLab CI | Google Cloud | Google Pub/Sub | Incident Response | Jenkins | Kubernetes | LLMs | Linux | Linux Shell | Linux Shell Scripting | Open edX | Pub/Sub | Python | Retrieval-Augmented Generation | SRE | Service Bus | Shell Scripting | Terraform | Vector Databases
Education
Roles
DevOps | DevOps Engineer | Engineer | Software Engineer | System Software Engineer
Related jobs
-
None Full Time深圳1d ago
-
Mid-level Full Time深圳1d ago
-
Apache Spark | Batch Processing | Big Data | Cloud Architecture | Cloud DataSenior-level Full TimeShenzhen, Guangdong Province, China1d ago
-
Senior-level Full TimeShanghai, China1d ago
-
数据平台工程师 CNY 180K-300KAWS | Azure | CI/CD | CloudFormation | Data GovernanceFlexible work arrangements | In-person collaborationMid-level Full TimeSHC01 - DXC Shanghai Campus Phase …1d ago
-
Sr. Consultant - Data Scientist CNY 360K-540KAgile | Computer Vision | Containerization | Data Governance | Data ScienceEmployee assistance program | Mindfulness programs | On demand digital course library | Personalized wellbeing programs | Volunteer matching programSenior-level Full TimeChina Shanghai (Hongmei)1d ago
-
Sr. Application Engineer CNY 360K-600KAutomated Workflows | C# | Cross-Functional Collaboration | Cross-functional | Data AnalysisSenior-level Full TimeChina - Beijing - Building 102, …1d ago
-
Mid-level Full Time北京 R2d ago
-
Entry-level Full Time北京 R2d ago
-
高级算法工程师(Nlp方向) CNY 240K-480KAgent Development | Agent development tools | Agent memory | CUDA | ChromaSenior-level Full Time北京2d ago
-
Entry-level Full Time北京 R2d ago
-
Mid-level Full Time北京 R2d ago
-
具身世界模型训练INFRA工程师 - XiaomiRobotics CNY 180K-360KDeep learning | DeepSpeed | Distributed Training | Fault Tolerance | KubernetesMid-level Full Time北京2d ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R2d ago
-
Entry-level Full Time北京2d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CPU resource scheduling | Cgroup | ContainerdMid-level Full Time北京 R2d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳2d ago
-
Ai应用工程师(提效方向 0-1) CNY 50K-50KAI Programming | AI Programming Tools | API Integration | JavaScript | Language ProcessingEngineering resource support | Hands-on product development | Model and compute support | Real world usageEntry-level Internship深圳2d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳2d ago
-
Entry-level Full Time北京2d ago
-
Entry-level Internship北京2d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Function Calling | GRPOEntry-level Full Time上海、北京2d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳2d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳2d ago
-
Entry-level Full Time北京、上海2d ago