Find jobs in AI/ML, Data Science and Big Data
4 results
for Prefix caching
(Skill/Tech stack)
-
Senior ML Engineer - Kimchi (LLM Inference Optimization GBP 110K-141KActivations quantization | Amazon Web Services | ArgoCD | CUDA | CUDA-adjacent toolingEquipment budget | Equity options | Extra days off | Hackathon | Learning budgetSenior-level Full TimeUnited Kingdom R8d ago
-
Senior ML Engineer - Kimchi (LLM Inference Optimization) PLN 292K-400KAWS | ArgoCD | Azure | CUDA | Chunked prefillAnnual hackathon | Conference access | Equipment budget | Equity options | Extra days offSenior-level Full TimePoland R8d ago
-
AI Engineer (m/w/d) EUR 47K-47KArgoCD | Automated testing | Clean Code | Code review | DPOCompany pension | Corporate benefits | Professional developmentSenior-level Full TimeBerlin, Berlin, DE27d ago
-
Senior Machine Learning Engineer, Runtime and Serving USD 213K-263KBenchmarking | Buffer management | C++ | CUDA | Concurrent SystemsSenior-level Full TimeMountain View, CA, USA1mo ago