DevOps Engineer, GPUaaS
Singapore, Singapore
SGD 162K-203K (estimate) Senior-level Full Time
Tasks
- Collaborate to streamline workflows and improve collaboration
- Conduct GPU cluster benchmark and track GPU technology advancements
- Design deploy and support GPU clusters for AI and ML workloads
- Design implement and manage CI CD pipelines for AI models and GPU accelerated applications
- Identify bottlenecks and improve development and operational processes for AI and HPC GPU cloud
- Implement security best practices for multi tenant GPUaaS
- Improve infrastructure provisioning management and monitoring through automation
- Manage and automate provisioning of GPU resources on prem and cloud
- Monitor cluster usage health performance and availability
- Optimize system parameters for AI workload performance
- Participate in rotational or scheduled shift work
- Provide technical support and guidance to users
- Set up monitoring and logging for GPU resources
- Solve problems in high performance distributed computation
- Troubleshoot compute resource system level issues
Perks/Benefits
- Flexible work arrangements
- Health and wellness benefits
- Internal mobility opportunities
- Training and development programs
Skills/Tech-stack
Ansible | Automation | Bash | CI/CD | CUDA | CentOS | Containers | Docker | GPU Acceleration | GPU Architecture | GPU drivers | IaaS | Infiniband | Jenkins | Kubernetes | Linux | Logging | MPI | Monitoring | NCCL | NVIDIA DCGM | NVIDIA GPUs | Networking | PaaS | Prometheus | PyTorch | Python | RDMA | Rocky Linux | Security | Slurm | TensorFlow | Terraform | Ubuntu | Zabbix
Roles
Related jobs
-
Enterprise Sales Engineer SGD 162K-213KAWS | Azure | C# | CI/CD | CSPMBest in class onboarding | Career pathing | Continuous professional development | Employee stock purchase plan | Global benefitsSenior-level Full TimeSingapore, Singapore1d ago
-
AI Deployment Engineer SGD 120K-171KChatGPT | Cloud Architecture | Enterprise Architecture | Generative AI | JavaScriptHybrid work model | Relocation assistanceSenior-level Full TimeSingapore1d ago
-
Agile | BI | Big Data | CI/CD | ClouderaConsulting Lifestyle | Travel opportunitiesMid-level Full TimeSingapore, Singapore, SG1d ago
-
Agile | Big Data | Cloud Data | Cloud Data Engineering | ClouderaTravel opportunitiesSenior-level Full TimeSingapore, Singapore, SG1d ago
-
Senior/Staff Engineer, FE Labor Prod. & Data Engrg SGD 96K-132KArtificial Intelligence | Automation | Change Management | Copilot Studio | DashboardingSenior-level Full TimeFab 10A, Singapore1d ago
-
Senior/Staff Engineer, FE Labor Prod. & Data Engrg SGD 96K-132KAutomation | Copilot Studio | Data Visualization | Generative AI | GovernanceSenior-level Full TimeFab 10A, Singapore1d ago
-
Actuator integration | BACnet | Block programming | Building Management | Building Management SystemsSenior-level Full TimeNgee Ann Polytechnic, Clementi Campus, Singapore1d ago
-
Speech Algorithm Engineer SGD 36K-130KAlgorithms | Audio Captioning | Audio Understanding | Audio Video Multimodality | Audio/VideoEntry-level Full TimeSingapore-CapitaSky1d ago
-
AI Platform Engineer SGD 143K-143KAWS Lambda | Amazon Bedrock | Amazon ECS | Amazon RDS | Amazon S3Mid-level Full TimeSGP Keppel Bay Tower, Singapore1d ago
-
ARM Templates | Application Insights | Aqua Security | Azure Container | Azure Container RegistrySenior-level Full TimeSingapore, Singapore1d ago
-
Machine Learning LLM Application Intern (Global LIVE Operation Intelligence) - 2026 Start (BS/MS) SGD 53K-57KDeep learning | Image Generation | Language Models | Large Language Models | Machine LearningEntry-level InternshipSingapore, Singapore1d ago
-
Senior Vision Development Engineer SGD 96K-132KC Sharp | C plus plus | C# | Cognex | Computer VisionCompetitive remuneration | Comprehensive benefitsSenior-level Full TimeSingapore, Singapore1d ago
-
Research Scientist, Machine Learning (PhD) SGD 60K-60KAgent Orchestration | Algorithm Design | Artificial Intelligence | Bias Mitigation | Computer VisionDynamic work environment | Professional growth | Work authorization supportEntry-level Full TimeSingapore1d ago
-
DataOps Engineer SGD 88K-100KAPI Management | Azure DevOps | CI/CD | Cloud Computing | Database DesignMid-level Full TimeSingapore1d ago
-
Entry-level Full TimeSingapore, Singapore, Singapore1d ago
-
Data Engineer - A26106 SGD 70K-108KAWS Glue | Agile | Amazon Athena | Amazon RDS | Amazon S3Coaching and mentoring | Employee wellness program | Equal employment opportunities | Fun working environment | Growth opportunitiesMid-level Full TimeSingapore, Singapore, Singapore2d ago
-
Data Scientist Intern - Singapore SGD 70K-90KBenchmarking | Computer Vision | Data Analysis | Deep learning | Machine LearningEntry-level InternshipSingapore2d ago
-
AI Research Engineer SGD 60K-108KAWS Lambda | Amazon EC2 | Amazon ECS | Amazon RDS | Amazon SNSContinuous education | Flexible work policy | Growth mindset | Hybrid work | Remote workMid-level Full TimeSingapore2d ago
-
Graduate Algorithmic Trader SGD 70K-90KAlgorithmic trading | Blockchain | Cryptocurrency Trading | Data Analysis | DeFiCompany outings | Gaming events | Holiday celebrations | Hybrid work | Performance-based compensationEntry-level Full TimeSingapore R2d ago
-
Senior Software Engineer (AI Product Development) SGD 147K-180KAgile | Best practices | CI/CD | Cloud | DockerSenior-level Full TimeSingapore, Singapore2d ago
-
Robotics Engineer – New College Graduate (NCG) SGD 33K-48K3D Printing | Actuator integration | C++ | CAD | CalibrationCareer development | Learning opportunities | Supportive work cultureEntry-level Full TimeSingapore,SGP3d ago
-
Senior Robotics Applications Engineer SGD 96K-132KC++ | CI/CD | Calibration | Cobot | Collision AvoidanceSenior-level Full TimeSingapore,SGP3d ago
-
Technical Lead, Machine Learning SGD 140K-191KBias | DPO | Data Pipelines | Deployment | GPU OptimizationSenior-level Full TimeSingapore3d ago
-
Principal Machine Learning Engineer SGD 140K-195KApache Arrow | Apache Spark | DPO | Data Processing | Deep learningSenior-level Full TimeSingapore3d ago
-
Member of Technical Staff, Machine Learning SGD 140K-191KData Pipelines | GPU Computing | Inference | JAX | Machine LearningSenior-level Full TimeSingapore3d ago