Find jobs in AI/ML, Data Science and Big Data
3 results
for Low-bit quantization
(Skill/Tech stack)
-
Architecture Design | Autoregression | Diffusion Models | Generative Models | Knowledge DistillationEntry-level InternshipSan Jose, California, United States24d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago