Find jobs in AI/ML, Data Science and Big Data
10 results
for Paged Attention
(Skill/Tech stack)
-
Senior-level Full Time上海、北京7h ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Cache optimization | Compiler optimization | Continuous batchingMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | CUDA | Continuous batching | Deep learning | Distributed TrainingMid-level Full TimeUnited States - Remote R5d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Communication Primitives | Continuous batchingMid-level Full TimeUnited States - Remote R5d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Continuous batching | Cutlass | DeepSpeedRemote workMid-level Full TimeUnited States - Remote R5d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | CUDA | Continuous batching | DeepSpeed | Distributed TrainingBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R5d ago
-
AI Performance Optimization Engineer USD 100K-150KAttention Mechanisms | Benchmarking | C++ | Compiler optimization | Continuous batchingBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R7d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Compiler optimization | Continuous batchingMid-level Full TimeUnited States - Remote R7d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago