Find jobs in AI/ML, Data Science and Big Data
4 results
for Chunked prefill
(Skill/Tech stack)
-
Senior ML Engineer - Kimchi (LLM Inference Optimization GBP 110K-141KActivations quantization | Amazon Web Services | ArgoCD | CUDA | CUDA-adjacent toolingEquipment budget | Equity options | Extra days off | Hackathon | Learning budgetSenior-level Full TimeUnited Kingdom R8d ago
-
Senior ML Engineer - Kimchi (LLM Inference Optimization) PLN 292K-400KAWS | ArgoCD | Azure | CUDA | Chunked prefillAnnual hackathon | Conference access | Equipment budget | Equity options | Extra days offSenior-level Full TimePoland R8d ago
-
AI/ML ASIC Architect USD 163K-249KARM | ASIC architecture | AXI interconnect | Area Optimization | Attention MechanismsSenior-level Full TimeMilpitas, CA, United States12d ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA25d ago