Find jobs in AI/ML, Data Science and Big Data
15 results
for Cutlass
(Skill/Tech stack)
-
Inference Engineer - Acceleration CHF 110K-160KAdmission control | CUDA | Cutlass | FlashAttention | KV cacheCommuting subsidy | Learning and development budget | Offsites and team events | Pension plan | Vacation daysMid-level Full TimeZürich, Switzerland2d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Cache optimization | Continuous batching | Cutlass | Deep learningMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 159K-264KC++ | Continuous batching | Cutlass | Deep learning | DeepSpeedRemote workMid-level Full TimeUnited States - Remote R4d ago
-
Research Engineer, ML Systems (All Industry Levels) USD 225K-400KCUDA | CUDA kernels | Cloud | Cutlass | DeepSpeedMid-level Full TimeRedwood City, CA5d ago
-
Staff Software Engineer, GPU Performance USD 207K-300KAMD | CUDA | Code generation | Compiler optimization | CutlassSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA7d ago
-
Research Engineer, Training & Inference USD 200K-450KC++ | CUDA | Cutlass | Distributed Training | FSDP401k matching | Employer-paid health insurance | Health Savings Account (HSA) | Unlimited PTOEntry-level Full TimePalo Alto13d ago
-
Senior Deep Learning Software Engineer, Inference USD 184K-356KAgile | C# | C++ | CUDA | CutlassEmployee benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
Senior Software Engineer, AI Inference Systems PLN 292K-507KAlgorithms | C++ | CI/CD | CUDA | CUDA GraphsHybrid workSenior-level Full TimeGermany, Remote R20d ago
-
Benchmarking | CUDA | CUDNN | Cutlass | Deep learningMid-level Full TimeUS-WA-Bellevue20d ago
-
Staff Technical Lead for Inference & ML Performance USD 180K-300KCUDA | Compilation | Cutlass | Distributed Serving | Kernel optimizationSenior-level Full TimeSan Francisco23d ago
-
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles USD 184K-356KC++ | CUDA | Cutlass | Efficient Attention | GPU ArchitectureSenior-level Full TimeUS, CA, Santa Clara, United States30d ago
-
Senior Solutions Architect, Generative AI USD 184K-287KC++ | CUDA | CUDNN | Containers | CutlassOccasional travel | Remote workSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior-level Full TimeChina, Shanghai1mo ago
-
Senior Software Engineer, LLM Performance USD 180K-339KC++ | CUDA | Cutlass | FlashAttention | FlashInferSenior-level Full TimeSF Bay Area (Hybrid) R1mo ago
-
C++ | CUDA | CUDNN | Cutlass | Distributed inferenceSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago