Senior ML Infrastructure Engineer
Tasks
- Architect next generation multi cluster orchestration
- Build developer productivity tooling
- Drive GPU utilization and training throughput
- Own and evolve multi cluster GPU infrastructure
- Own compute budget and cost efficiency
Perks/Benefits
- N/A
Skills/Tech-stack
CI/CD | CUDA | Cluster management | Distributed Systems | Distributed Training | Docker | Experiment tracking | GCP | GPU Profiling | GitHub Actions | Machine Learning | Machine Learning Pipelines | Model Registry | PyTorch | Slurm | Triton | WandB
Education
N/A
Regions
Countries
States
Related jobs
-
Werkstudent (m/w/d) Machine Learning Engineering EUR 17K-19KChunking | Data Pipelines | Embeddings | Inference | Language ModelsEmployee discounts | Health and fitness support | Networking opportunities | Professional development | Work-life balanceEntry-level Part Time TemporarySaarbrücken4h ago
-
Agile | C# | C++ | CI/CD | CoAPMobile work up to 40 percentSenior-level Full TimeWilnsdorf, DE, 572345h ago
-
C++ | Control Engineering | Geometric Methods | Linear Optimization | Machine LearningCareer development support | Flexible working hours | Mobile work | Research collaboration | Research mentorshipEntry-level Full TimeKarlsruhe, Keine Entgeltgruppe12h ago
-
Algorithms | C++ | CI/CD | Calibration | ControlMid-level Full TimeMunich (DEU)19h ago
-
C++ | CI/CD | Control | Data Structures | DebuggingAgile team | Collaborative culture | Continuous development | International team | Team eventsMid-level Full TimeMunich (DEU)19h ago
-
Databricks & Agentic AI Expert (m/w/d)* EUR 50K-70KApache Spark | Application Insights | Azure | Azure DevOps | BicepBusiness travel within Germany | Company pension scheme | Corporate benefits | Flexible working hours | International exchange programMid-level Full TimeMünchen21h ago
-
MLOps & GenAI Platform Engineer (m/f/d) EUR 44K-60KAWS Bedrock | AWS SageMaker | CI/CD | Containerization | Docker30 days vacation | Capital-forming benefits | Employee discounts | Employee restaurant | Flexible hoursSenior-level Full TimeMunich, BY, Germany21h ago
-
Agentic AI Software Engineer EUR 38K-51KAWS | Azure | CI/CD | Data Modeling | Databases designCareer certifications | Conferences | Flexible working model | Hackathons | Knowledge sharing eventsEntry-level Full TimeMunich, Bayern, Germany23h ago
-
API Development | AWS | Airflow | Athena | Data ProcessingAsync-friendly culture | Conference support | Flexible working hours | Fully remote | Personal development supportSenior-level Full TimeGermany R1d ago
-
Bash | C# | C++ | CI/CD | Cloud30 days vacation | Flexible working hours | Health insurance package | Remote work within European Union | Sports club membershipMid-level Full TimeMunich, Germany R1d ago
-
Databricks & Agentic AI Expert (m/w/d)* EUR 50K-70KApache Spark | Application Insights | Azure | Azure DevOps | BicepBusiness travel within Germany | Company pension plan | Corporate benefits | Flexible work hours | Health care benefitsMid-level Full TimeHamburg1d ago
-
Machine Learning Engineer EUR 32K-37KDocker | Kubernetes | MLOps | MLflow | Machine LearningRemote workMid-level Full TimeBerlin, Germany; Helsinki, Finland R1d ago
-
Principal Agentic AI Engineer EUR 56K-79KAPI Integration | Agentic Workflows | Distributed Systems | Language Models | Large Language ModelsCollaborative team events | Company hackathons | Flexible working hours | Hybrid work | Paid vacation daysSenior-level Full TimeSchwalbach / Frankfurt, Germany R1d ago
-
(Senior) SAP AI Engineer (f/m/d) EUR 65K-65KAI Core | AI Foundation | AI Launchpad | AI-Hub | API DevelopmentSenior-level Full TimeGarching bei München, DE, 857481d ago
-
DevSecOps AI Engineer (f/m/d) EUR 38K-79KAPI Design | Agentic Workflows | Audit Logging | Azure | By DesignMid-level Full TimeFrankfurt, DE, 60323 R1d ago
-
(Senior) SAP AI Engineer (f/m/d) EUR 65K-65KAI Core | AI Foundation | AI Launchpad | API Management | API VersioningSenior-level Full TimeGarching bei München, DE, 857481d ago
-
Agentic AI Engineer expert (f/m/d) EUR 61K-65KAPI Architecture | AWS | Azure | CI/CD | Cloud ComputingFlexible working models | Learning opportunities | Skill growthSenior-level Full TimeGarching bei München, DE, 857481d ago
-
C++ | Computer Aided Design | Imitation Learning | Language Models | Large Language ModelsFlexible working hours | Modern equipment | Part-time option | Promotion supportEntry-level Full TimeAugsburg, DE, 861591d ago
-
AWS | Airflow | Apache Flink | Apache Hadoop | Apache KafkaFully paid parental leave | Fully remote first working environment | Home office stipend | Manager coaching | Paid time offSenior-level Full TimeGermany R1d ago
-
(Senior) AI Engineer (all genders) EUR 65K-75KCloud Platforms | Containerization | DevOps | Docker | Language Models30 days vacation | E-learning support | Employee participation | Fitness benefits | Flexible work optionsMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R2d ago
-
Senior Data Platform Engineer EUR 62K-84KAI | Airflow | Anonymization | CQRS | CTEAnnual vacation days | Employee stock options | Flexible working hours | Free tax filing support | Mental health coachingSenior-level Full TimeBerlin3d ago
-
(Junior) AI Engineer (m/w/d) Testautomatisierung EUR 35K-35KAutomation | Language Models | Large Language Models | Machine Learning | PythonBike leasing | Career growth | Employee snacks and drinks | Job ticket subsidy | Subsidized pension schemeEntry-level Full TimeDortmund, Nordrhein-Westfalen, Germany3d ago
-
(Senior) Data Platform Engineer (m/f/d) EUR 45K-67KABAC | Airflow | Audit Logging | Automated testing | CI/CD30 days vacation | Bike leasing | Discounts | Flex budget for expenses | Flexible working hoursMid-level Full TimeGermany, Munich, Germany, Berlin, France, Paris3d ago
-
Senior Data Engineer* EUR 50K-60KAirflow | CI/CD | Cloudera | Docker | ETL30 days holiday | Capital-forming benefits | Company and team events | Company pension scheme | Corporate benefitsSenior-level Full TimeBraunschweig, Niedersachsen, DE3d ago
-
Staff MLOps Engineer (AI/ML Platform) EUR 56K-78KAWS | AWS EKS | Apache Spark | Batch Scoring | CachingSenior-level Full TimeRemote, Remote, Germany R4d ago