Find jobs in AI/ML, Data Science and Big Data
14 results
for Kernel Fusion
(Skill/Tech stack)
-
Research Scientist - Distributed Machine Learning USD 180K-287KBF16 | CUDA | CUDA kernels | DeepSpeed | Distributed Training401k | Dental insurance | Disability insurance | Employee assistance program | Health insuranceMid-level Full TimeSunnyvale, CA2d ago
-
Machine Learning Infrastructure Engineer USD 216K-330KCUDA | DeepSpeed | Distributed Systems | Distributed Training | FSDPMid-level Full TimeSunnyvale, CA2d ago
-
AllGather | AllReduce | Artificial Intelligence | Asynchronous pipelines | BenchmarkingSenior-level Full TimeSeattle, United States R6d ago
-
Deep learning | Distributed Training | Flash Attention | Inference Optimization | Kernel FusionHybrid workSenior-level Full TimeToronto, Ontario, Canada17d ago
-
Senior Systems Software Engineer, Performance Architecture - Analytics and Data Intelligence USD 224K-356KApache Arrow | Benchmarking | C++ | CI/CD | CUDAEquitySenior-level Full TimeUS, CA, Santa Clara, United States18d ago
-
Batching | C# | C++ | CUDA | FP16Dental insurance | Disability insurance | Flexible spending account | Flexible vacation | Health insuranceMid-level Full TimeAnywhere, USA R21d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | GPU Kernels | Kernel Fusion | Machine Learning | Nsight401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimeRemote U.S. R1mo ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Kernel Fusion | Nsight | PyTorch | PyTorch Profiler401k match | Dental insurance | Health Accounts | Health savings account | Life insuranceSenior-level Full TimeLas Vegas, Nevada, United States R1mo ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Distributed Training | GPU Performance | GPU performance profiling | Kernel Fusion401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States R1mo ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Distributed Training | Kernel Fusion | Nsight | PyTorch401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimeBoston, Massachusetts, United States R1mo ago
-
Research Scientist (Visual Generative AI & World Models) GBP 195K-270KATen | C++ | CUDA | Calculus | Deep learningDental plan | Employee assistance programme | Employee wellbeing support | Flexible working | Generous annual leaveSenior-level Full TimeLondon, UK1mo ago
-
Engineering Manager, Model Inference USD 220K-270KAPIs | Attention Mechanism | Batching | Distributed Systems | Docker401k matching | Commuter benefits | Flexible PTO | Flexible spending accounts | Generous time offMid-level Full TimeSF Office1mo ago
-
Auto-tuning | C++ | CUDA | Cache behavior | Computer ArchitectureComprehensive benefits packageSenior-level Full TimeIsrael, Yokneam1mo ago
-
Activation checkpointing | Attention Mechanisms | CUDA | Collective operations | Data parallelismSenior-level Full TimeMountain View, California; San Francisco, California1mo ago