Senior Data Platform Infrastructure Engineer, Off
Tasks
- Build and maintain CI CD pipelines with infrastructure as code
- Build and operate observability services and audit logging
- Create infrastructure runbooks and disaster recovery plans
- Define and track SLO SLA operational coverage and health checks
- Design and maintain secure platform networking foundations
- Implement and operate security and identity management capabilities
- Operate and harden catalog and storage layers
- Optimize cluster workload performance and cost efficiency
- Own platform health reporting and operational readiness
- Provision and operate platform compute and storage lifecycle manage
- Support production troubleshooting and incident response
Perks/Benefits
- Employee networks
- Flexible work/life support
- Inclusive development opportunities
- Paid volunteer days
Skills/Tech-stack
AWS | AWS CloudFormation | Access Management | Alerting | Audit Logging | Backup/Restore | CI/CD | Chargeback | CloudWatch | Containers | DNS | Databricks | Datadog | Disaster Recovery | FinOps | Firewalling | GCP | Grafana | IAM | Iceberg | Identity and Access Management | Identity and access | Infrastructure as Code | Kubernetes | Microsoft Azure | Monitoring | Networking | Polaris | Private Connectivity | Prometheus | RBAC | SLA | SLO | Secrets management | Showback | Snowflake | Spark | Splunk | Terraform | Trino | Unity Catalog | VNet | VPC | “as-code”
Education
N/A
Related jobs
-
Mid-level Full Time北京 R5h ago
-
Audit Logging | CI/CD | Data Governance | Data Privacy | Drift DetectionSenior-level Full TimeShanghai, Shanghai, China13h ago
-
Senior AI Engineer CNY 240K-480KAgent Orchestration | Authentication | Authorization | CI Gates | CI/CDSenior-level Full TimeChina16h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Friendly work environment | Hands-off management | Remote/distributed workMid-level Full TimeShanghai, China18h ago
-
Forward Deployed AI Engineer CNY 37K-37KAWS | Agile | Azure | BigQuery | Cloud ComputingTravel opportunitiesEntry-level Full Time Internship北京1d ago
-
Mid-level Full Time北京1d ago
-
Entry-level Full Time深圳、上海、北京、中国香港1d ago
-
【26届校招】大语言模型后训练算法工程师(Foundation Model) CNY 240K-480KData loading | Distributed Training | Docker | Fine Tuning | Inference OptimizationEntry-level Full Time上海、深圳1d ago
-
Entry-level Full Time北京 R1d ago
-
Agent 全栈研发工程师(前/后端)-MiMo CNY 180K-300KAPI Design | Authentication | Authorization | Browser Automation | CI/CDEntry-level Full Time北京1d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KContainerd | Distributed Systems | Docker | ELK | File SystemMid-level Full Time北京 R1d ago
-
Mid-level Full Time上海1d ago
-
Mid-level Full Time北京、上海1d ago
-
AGI 服务端工程师 (AI Agent / AI App) CNY 180K-300KBenchmarking | C++ | Containers | Data Engineering | DifyMid-level Full Time北京、上海1d ago
-
Senior-level Full Time北京、上海1d ago
-
Entry-level Full Time北京、上海1d ago
-
A/B | A/B Testing | AWS | Accessibility | B testingEntry-level Full TimeBeijing, China1d ago
-
API Design | AWS | CI/CD | Event Driven | Event-driven architectureEntry-level Full TimeBeijing, China1d ago
-
Senior-level Full TimeShanghai, Shanghai, China1d ago
-
Senior Machine Learning Engineer II CNY 240K-480KAPI Integration | AWS | Artificial Intelligence | Azure | Azure DevOpsAnnual Medical Checkup | Examination leave | Family care leave | Flexible benefits | Life insuranceSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
Senior Machine Learning Engineer II CNY 240K-480KAPI Integration | AWS | Apache Spark | Artificial Intelligence | Azure DevOpsAnnual Medical Checkup | Flexible benefits | Life insurance | Long service award | Medical insuranceSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
Senior-level Full Time上海、武汉、北京3d ago
-
算法工程师-大模型数据方向 CNY 240K-480KClustering | Data Annotation | Data Deduplication | Data Governance | Data QualitySenior-level Full Time上海3d ago
-
Mid-level Full Time上海3d ago
-
Senior-level Full Time上海3d ago