Find jobs in AI/ML, Data Science and Big Data
3 results
for Prefill decode disaggregation
(Skill/Tech stack)
-
AI Software Engineer USD 151K-332KC++ | CUDA | CUDA kernels | CUDA profiling | Cache ManagementCommunity involvement | Health benefits | Hybrid work options | In-person work options | Remote work optionsMid-level Full TimeSeattle (WA), United States5d ago
-
Senior Inference Engineer, AIConfigurator for Dynamo USD 184K-356KBatching | Distributed Systems | Expert parallelism | GPU Computing | High PerformanceEquity | Health benefits | Hybrid workSenior-level Full TimeUS, CA, Santa Clara, United States18d ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India1mo ago