Find jobs in AI/ML, Data Science and Big Data
15 results
for GPU Kernel
(Skill/Tech stack)
-
Containerization | Debugging | DeepSpeed | Distributed Systems | GPU KernelMid-level Full TimeAbu Dhabi2d ago
-
Inference Optimization Intern – Performance Modeling USD 40K-142KC++ | CUDA | GPU Architecture | GPU Kernel | GPU Kernel DevelopmentEntry-level InternshipSunnyvale, CA5d ago
-
AllGather | AllReduce | Artificial Intelligence | Asynchronous pipelines | BenchmarkingSenior-level Full TimeSeattle, United States R6d ago
-
Application Software Engineer, Inference USD 135K-185KAgent Orchestration | Agent SDK | Auto Scaling | Batch scheduling | C++401k plan | Employee stock purchase plan | Long-term incentives | Medical, dental & vision coverage | Onsite Palo AltoEntry-level Full TimePalo Alto, CA11d ago
-
Inference Optimization Manager USD 229K-286KCloud infrastructure | Distributed Systems | GPU Kernel | GPU kernel programming | Inference engine401k matching | Flexible paid time off | Health insurance | Remote work options | Team onsite eventsMid-level Full TimeUnited States / Canada12d ago
-
CUDA | CUDNN | Cutlass | Deep learning | GPU ArchitectureMid-level Full TimeUS-WA-Bellevue13d ago
-
LLM Inference Frameworks and Optimization Engineer USD 160K-230KC++ | CUDA | CUDA graph | Cluster scheduling | CompilerEquity | Health insuranceMid-level Full TimeSan Francisco, Singapore, Amsterdam14d ago
-
Research Engineer (LLM Training and Performance) GBP 80K-120KAOTAutograd | CUDA | CuTe | Cutlass | Data loadersSenior-level Full TimeAmsterdam, Netherlands; Belgrade, Serbia; Berlin, Germany; …15d ago
-
Senior Scientific Machine Learning Engineer – Earth-2 USD 152K-287KCUDA | Containers | Data parallelism | Diffusion Models | GPU KernelBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara R16d ago
-
A/B | A/B Experimentation | Autoscaling | Caching | Canary testingCommute subsidy | Disability insurance | Employee stock ownership | Generous vacation | Health insuranceSenior-level Full TimeShanghai, China21d ago
-
Airflow | CUDA | Data Lake | Data Warehouse | FlinkCommute subsidy | Competitive retirement pension plans | Employee resource groups | Employee stock ownership | Generous vacation personal daysSenior-level Full TimeShanghai, China21d ago
-
A/B | A/B Testing | Autoscaling | B testing | Canary testingCommute subsidy | Competitive retirement pension plans | Employee resource groups | Employee stock ownership | Generous vacationSenior-level Full TimeShanghai, China21d ago
-
Staff Compiler Engineer - PyTorch + Kernel DSLPLATE USD 163K-253KAutotuning | Collective Primitives | Cost Based Compilation | Custom ISA | Cutlass401k | Adoption support stipend | Charitable giving match | Fertility care stipend | Flexible work environmentSenior-level Full TimeSan Jose, California, United States27d ago
-
Entry-level Full TimeNew York, NY, United States29d ago
-
Auto-tuning | C++ | CUDA | Cache behavior | Computer ArchitectureComprehensive benefits packageSenior-level Full TimeIsrael, Yokneam1mo ago