Find jobs in AI/ML, Data Science and Big Data
4 results
for Paged Attention
(Skill/Tech stack)
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China14d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China14d ago
-
Member of technical staff (Inference) - Paris EUR 80K-120KC++ | CUDA | Caching | Continuous batching | Distributed ComputingCareer development | Continuous learning | Hybrid work | Professional growthSenior-level Full TimeParis28d ago
-
Member of technical staff (Inference) - London GBP 230K-325KC++ | CUDA | CUDA kernel | CUDA kernel programming | CachingContinuous learning | Hybrid work | Professional developmentSenior-level Full TimeLondon1mo ago