Senior Software Engineer, AI Inference Systems
Tasks
- Architect scheduling and orchestration for inference deployments
- Benchmark GPU kernels and inference
- Build kernel DSLs and compiler infrastructure
- Conduct and publish ML systems research
- Contribute features to vLLM
- Define MLPerf inference benchmarking tools
- Deploy containerized inference on GPU clusters
- Develop and optimize GPU kernels
- Profile and optimize inference framework
Perks/Benefits
Skills/Tech-stack
Algorithms | C++ | CI/CD | CUDA | CUDA Graphs | Cgroups | Computer Architecture | Cutlass | Data Structures | Deep learning | Distributed Systems | Docker | Infrastructure as Code | Kubernetes | LLVM | Linux | Linux Namespaces | MLIR | NCCL | Nsight Compute | Nsight Systems | Operating Systems | Parallel Programming | Production observability | Profiling | PyTorch | PyTorch Inductor | Python | SGLang | Slurm | Tensor cores | TorchDynamo | Triton | VLLM | XLA | “as-code”
Education
Related jobs
-
Data Analysis | Deep learning | Machine Learning | NumPy | PandasCompany bike | Corporate discounts | Health awareness | Hybrid work | Inclusive work environmentEntry-level Full TimeMonheim, North Rhine Westfalia, DE R13h ago
-
Data Analysis | Deep learning | Machine Learning | Model Development | NumPyChildcare support | Coaching and mentoring | Flexible work arrangements | Health checks | Health seminarsEntry-level Full TimeMonheim, Nordrhein-Westfalen, DE R13h ago
-
API Integration | AWS Lambda | AWS Secrets | AWS Secrets Manager | AWS Systems ManagerCollaborative ownership driven engineering culture | Continuous learning and professional development | Flexible working hours | Fully remote across Europe | Work-life balanceMid-level Full TimeGermany R18h ago
-
API Integration | Banking integrations | CRM | CrewAI | DATEVDeutschland-Ticket subsidy | Employer pension contribution | Fitness and wellness benefits | Free snacks and beverages | Hardware allowanceMid-level Part TimeOffice Stuttgart, Office Berlin, Homeoffice Berlin, … R1d ago
-
API Authentication | API Keys | Apache Airflow | CI/CD | DBTFlexible collaboration hours | Fully remote work | High ownership | Work-life balanceSenior-level Full TimeGermany R1d ago
-
AI infrastructure | Benchmarking | Bottleneck analysis | Cloud Computing | Deep learningContinuous learning | Flexible location | Fully remote | Professional development | Technical autonomyMid-level Full TimeGermany R1d ago
-
Finance Data Engineer EUR 56K-84KAirflow | Autogen | Azure Data | Azure Data Factory | BAPIsCompany events | Enhanced parental leave | Family emergency leave | Gym membership | Learning allowanceSenior-level Full TimeMunich R1d ago
-
AI Engineer (m/w/d) EUR 48K-63KAI architecture | C# | Cloud Computing | Containers | Data Pipelines30 days paid time off | Discounted lunch | Employer pension contributions | Fitness center membership | Free beveragesMid-level Full TimeBerlin, Remote R1d ago
-
Junior Data Engineer EUR 30K-32KAirbyte | Airflow | Amazon Web Services | BigQuery | CI/CDDaycare Benefits | Deutschland ticket | Gym and fitness membership | Health and wellness benefits | Healthy MealsEntry-level Full TimeBerlin R1d ago
-
(Senior) AI Engineer (all genders) EUR 65K-75KCloud Platforms | Containerization | DevOps | Docker | LLM Inference30 days vacation | Bio Fruit Vegetable | Businessbike | Company fitness | E-learning reimbursementMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R1d ago
-
Chatbot LLM Engineer (all genders) EUR 45K-66KAPI Integration | Agile | LLM | Language Models | Large Language Models30 days vacation | Business bike | Company fitness | E-learning reimbursement | Family-friendly supportMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R1d ago
-
Amazon Redshift | Data Modeling | Data Pipelines | Data Warehousing | Database DesignSenior-level Full TimeGermany R1d ago
-
Cloud infrastructure | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Continuous learning opportunities | Coworking access | Flexible work hours | Fully remoteMid-level Full TimeGermany R1d ago
-
C# | MATLAB | NumPy | Pandas | PythonPart-timeSenior-level Full TimeGermany - Remote R1d ago
-
Cloud Computing | Distributed Systems | Go | Infrastructure as Code | Kubernetes100 percent remote | Annual leave policy | Global culture | Mentorship | On-call rotationSenior-level Full TimeGermany (Remote) R2d ago
-
Internship Measurement Systems and Machine Learning for Inertial Sensor Characterization EUR 31K-31KCircuit design | Control Systems | Data Pipelines | Data Preprocessing | ElectronicsHybrid work setup | University enrollment requirementEntry-level Full Time InternshipKusterdingen, BW, Germany R2d ago
-
Amazon Redshift | Code review | DBT | Dashboards | Data CrawlingCareer development | Charitable contributions | Fitness reimbursement | Health insurance | Inclusive collaborative cultureMid-level Full TimeGermany R2d ago
-
AI/ML Engineer (w/m/d) EUR 50K-68KAWS | Artificial Intelligence | Cloud Computing | Data Engineering | Data platformAnnual learning budget | Company and team events | Flexible working hours | Internet stipend | Meal subsidyMid-level Full TimeBerlin, Germany, Remote, Germany R2d ago
-
AI Research Engineer - Computer Vision GBP 65K-90KAdversarial Attacks | Computer Vision | Deep learning | Edge Computing | Hardware optimizationEnhanced parental leave | Gym membership | Learning allowance | Mental health support | Paid Family Emergency LeaveSenior-level Full TimeBerlin; London; Munich R2d ago
-
C# | MATLAB | NumPy | Pandas | PythonFlexible hours | Freelance project-based work | Part-time remote workSenior-level FreelanceGermany - Remote R2d ago
-
Bilby | CAMB | Class | DOLFINx | FenicsPart time freelance projects | Project based workSenior-level FreelanceGermany - Remote R2d ago
-
Computational physics | Electromagnetism | Mechanics | NumPy | Numerical SimulationPart-time freelanceEntry-level FreelanceGermany - Remote R2d ago
-
C# | MATLAB | NumPy | Pandas | PythonFreelance projectsEntry-level FreelanceGermany - Remote R2d ago
-
Bash | C# | C++ | CI/CD | Cloud30 days vacation | Flexible working hours | Health insurance package | Remote work within European Union | Sports club membershipMid-level Full TimeMunich, Germany R3d ago
-
Machine Learning Engineer EUR 32K-37KDocker | Kubernetes | MLOps | MLflow | Machine LearningRemote workMid-level Full TimeBerlin, Germany; Helsinki, Finland R3d ago