Find jobs in AI/ML, Data Science and Big Data
11 results
for Operator fusion
(Skill/Tech stack)
-
Attention | Batching | C++ | CUDA | CUDA kernelsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States6d ago
-
Batching | Benchmarking | C# | C++ | CUDASenior-level Full TimeBala Cynwyd (Philadelphia Area), PA, United …12d ago
-
Machine Learning Engineer - AI Compiler Optimization USD 156K-387KAutomatic Operator Generation | Code generation | Compilation Optimization | Graph optimization | Hardware-Software CodesignMid-level Full TimeSan Jose, California, United States18d ago
-
AI Software Lead – PyTorch & CUDA Runtime (Next-Gen Accelerator) INR 2475K-3380KC# | C++ | CUDA | Compiler runtime interaction | Compiler/runtimeSenior-level Full TimeBengaluru, KA, India22d ago
-
AI/ML performance engineer INR 1500K-2100KAndroid | Bandwidth analysis | Batching | Benchmarking | DMAMid-level Full TimeBangalore, Karnataka, India29d ago
-
Compiler | Emulation | Firmware debugging | Machine Learning | Memory bandwidthMid-level Full TimeSan Jose, California, United States1mo ago
-
Senior-level Full TimeSan Jose, CA, United States1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
Senior Deep Learning Compiler Verification Engineer USD 140K-224KC++ | Formal verification | Graph optimization | IR lowering | JAXComprehensive benefits package | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
ARM Mali | Android | Apple Neural Engine | C++ | CoreMLCommute subsidy | Employee assistance program | Employee resource groups | Employee stock ownership | Generous vacation and personal daysSenior-level Full TimeMountain View, CA, USA1mo ago