Find jobs in AI/ML, Data Science and Big Data
4 results
for PagedAttention
(Skill/Tech stack)
-
Senior-level Full TimeDublin, Ireland1d ago
-
AI Inference Engineer - Model Optimization & Deployment USD 205K-303KAccuracy evaluation | BF16 | C++ | CUDA | CUDA kernelsSenior-level Full TimeFoster City, CA6d ago
-
Entry-level Internship北京21d ago
-
AWQ | C++ | CUDA | CUDA kernels | Continuous batchingSenior-level Full TimeDonostia, Spain22d ago