Site Reliability Engineer - ARK Large Model Platform (Singapore)
Singapore, Singapore
Mid-level Full Time Found 14d ago
Tasks
- Develop observability systems
- Ensure high reliability and performance
- Handle large-scale cluster management
- Improve IT cost efficiency
- Manage system stability
- Research large model solutions
Perks/Benefits
- N/A
Skills/Tech-stack
Cloud Native | Cluster management | DevOps | High Performance | High-Performance Computing | Model Inference | Monitoring | Multi-framework | Performance Computing | Resource scheduling | Web IDE
Language: en |
Views: 1 |
Clicks: 0
Related jobs
-
Mid-level Full TimeSingapore1d ago
-
Mid-level Full TimeSingapore, Singapore, Singapore2d ago
-
AES | Audit Log | Audit Log Analysis | Azure DevOps | Azure MonitorSenior-level Full TimeNgee Ann Polytechnic, Clementi Campus, Singapore3d ago
-
Data Analysis | Data Systems | Data Systems Design | Distributed Systems | High PerformanceMid-level Full TimeSingapore, Singapore3d ago
-
Senior-level Full TimeSingapore, Singapore, Singapore4d ago
-
Senior AI Engineer SGD 140K-185KAPI Development | Agent Framework | Agent systems | Agent workflows | Asynchronous programmingSenior-level Contract Full TimeSingapore, Singapore, Singapore6d ago
-
Data Engineer SGD 107K-143KApache NiFi | Automation | Batch Processing | Cloudera | Data GovernanceSenior-level Full TimeSingapore, Singapore, Singapore6d ago
-
Senior Software Engineer MDM SGD 115K-185KAPI Development | BigQuery | Data Governance | Data Modeling | Data QualitySenior-level Full TimeSingapore Office SGO10d ago
-
DevOps Engineer, GPUaaS SGD 140K-200KAI frameworks | Ansible | Automation | Bash | CI/CDFlexible work | Health benefits | Internal mobility | Training and developmentSenior-level Full TimeSingapore, Singapore10d ago
-
Platform Engineer SGD 147K-203KAWS | Airflow | Athena | Cloud Native | Cluster planning401k matching | AI-driven tech | Dental and vision | Diverse workplace | Global volunteeringSenior-level Full TimeSingapore, Singapore, Singapore11d ago
-
AI model co-design | CV | Co-design | Cost Optimization | Cross domainSenior-level Full TimeSingapore, Singapore14d ago
-
AI | AIOps | Capacity Planning | Cause analysis | Cloud servicesSenior-level Full TimeSingapore, Singapore14d ago
-
Automation | Data Ingestion | Data Pipelines | Data Transformation | Fault ToleranceEntry-level Full TimeSingapore, Singapore14d ago
-
Cloud Native | Computer Science | Engineering | Inference services | Mathematical AnalysisMid-level Full TimeSingapore, Singapore14d ago
-
Cloud Native | Cluster management | Cluster solutions | Kubernetes | Multi-ClusterMid-level Full TimeSingapore, Singapore14d ago
-
Cloud Computing | Cluster governance | Coding | Disaster Recovery | Distributed SystemsMid-level Full TimeSingapore, Singapore14d ago
-
Cloud Native | Cluster management | Elasticity | Kubernetes | Multi-cloudMid-level Full TimeSingapore, Singapore14d ago