Find jobs in AI/ML, Data Science and Big Data
3 results
for GPU Cluster Management
(Skill/Tech stack)
-
Software Engineer - Reliability USD 200K-250KAWS | Bash | Cluster management | Containerization | DCGMSenior-level Full TimeSF Bay Area, CA, Remote, US R1d ago
-
Senior Software Engineer - ML Ops CAD 140K-171KArgoCD | Autoscaling | CI/CD | Cluster Autoscaler | Cluster managementFour day week Summer 2026 | In person collaboration 2 days per week London ONSenior-level Full TimeLondon, Ontario, Canada8d ago
-
Senior MLOps & AI Infrastructure Engineer USD 149K-215KAWS SageMaker | Airflow | Arize | Azure Machine Learning | BashSenior-level Full TimeSan Jose, California, United States, United …20d ago