aijobs.net

Sign in

Site Reliability Engineer, Machine Learning Systems - Singapore

Singapore, Singapore

Mid-level Full Time

Apply Save

Found 1mo ago

Tasks

Develop monitoring and management tools for ML infrastructure and services
Ensure ML systems operate efficiently for deployment, training, evaluation, inference
Implement disaster recovery plans, cluster governance, and improve operational stability and efficiency
Maintain stability of offline tasks/services across multi-data center, multi-region, multi-cloud
Manage resources including computing and storage, plan capacity, control costs
Provide on-call support for system and business issues

Perks/Benefits

N/A

Skills/Tech-stack

Cloud Computing | Cluster governance | Coding | Disaster Recovery | Distributed Systems | Global Collaboration | Monitoring | Performance Analysis | Resource Management | Scalability | Server management | Storage Systems | System Stability

Education

Bachelor's | Master's

Roles

Site Reliability Engineer | Site Reliability Engineer - Machine Learning Systems

Regions

Countries

Cities

Apply Save

Language: en | Views: 1 | Clicks: 0 | Saves: 0

Related jobs

DevOps Engineer – Azure & Kubernetes A SGD 162K-203K

ACR | Azure DevOps | Azure Key Vault | Azure Kubernetes | Azure Kubernetes Service

Collaborative environment | DevOps and MLOps culture building | Impactful projects

Senior-level Full Time

Singapore, Singapore

9d ago
Site Reliability Engineer, Applied Machine Learning Engine (Singapore) SGD 70K-70K

Hardware Integration | Large System Operation | Machine Learning | Performance Analysis | Software development

Entry-level Full Time

Singapore, Singapore

15d ago
Site Reliability Engineer - Applied Machine Learning Engine (Singapore) SGD 65K-65K

Automation | Coding | Distributed Systems | Hardware Integration | Mathematical Analysis

Entry-level Full Time

Singapore, Singapore

1mo ago
Site Reliability Engineer - ARK Large Model Platform (Singapore)

Cloud Native | Cluster management | DevOps | High Performance | High-Performance Computing

Mid-level Full Time

Singapore, Singapore

1mo ago