Find jobs in AI/ML, Data Science and Big Data
28 results
for XLA
(Skill/Tech stack)
-
AI Software Lead – PyTorch & CUDA Runtime (Next-Gen Accelerator) INR 2475K-3380KC# | C++ | CUDA | Compiler runtime interaction | Compiler/runtimeSenior-level Full TimeBengaluru, KA, India2d ago
-
AI SW Stack Deployment Architect INR 2500K-4500KAPI Design | Cloud Computing | Distributed Systems | Edge Computing | Inference ServerSenior-level Full TimeBengaluru, KA, India2d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Continuous batching | Deep learning | Distributed Systems | FSDPMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Cache optimization | Continuous batching | Cutlass | Deep learningMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess patterns | Benchmarking | C++ | Cache optimization | Compiler optimizationFull-time W2 employment | Health benefits | Remote workMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 159K-264KC++ | Continuous batching | Cutlass | Deep learning | DeepSpeedRemote workMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 136K-258KBenchmarking | C++ | Compiler optimization | Continuous batching | DebuggingMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess Optimization | Attention Optimization | Benchmarking | C++ | Compiler optimizationMid-level Full TimeUnited States - Remote R5d ago
-
Senior Software Engineer, CUDA Deep Learning Systems USD 184K-356KC++ | CUDA | CUDA kernel | CUDA kernel optimization | Computer ArchitectureEquity | Health benefits | Paid time offSenior-level Full TimeUS, CA, Santa Clara, United States7d ago
-
Machine Learning Intern/Co-op (Fall, 2026) CAD 60K-60KCUDA | Distributed Training | GPU | JAX | MLIRCo-working stipend | Health and dental benefits | Inclusive culture | Lunch stipend | Parental leave top-upEntry-level InternshipCanada7d ago
-
Staff Software Engineer, GPU Performance USD 207K-300KAMD | CUDA | Code generation | Compiler optimization | CutlassSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA7d ago
-
Research Engineer, Pretraining, DeepMind USD 174K-252KFine Tuning | Inference Optimization | JAX | Language Models | Large Language ModelsMid-level Full TimeNew York, NY, USA13d ago
-
Senior Software Engineer, AI Inference Systems PLN 292K-507KAlgorithms | C++ | CI/CD | CUDA | CUDA GraphsHybrid workSenior-level Full TimeGermany, Remote R20d ago
-
C# | C++ | CPU architecture | CUDA | Deep learningSenior-level Full TimeUS, CA, Santa Clara, United States21d ago
-
Senior-level Full TimeUS, CA, Santa Clara, United States21d ago
-
APIs | Compiler infrastructure | Device Drivers | Firmware | Hardware Aware TechniquesExecutive-level Full TimeHerzliya, Israel, IL21d ago
-
Senior Machine Learning Engineer, Runtime and Serving USD 213K-263KBenchmarking | Buffer management | C++ | CUDA | Concurrent SystemsSenior-level Full TimeMountain View, CA, USA21d ago
-
Senior Deep Learning Compiler Verification Engineer USD 140K-224KC++ | Formal verification | Graph optimization | IR lowering | JAXComprehensive benefits package | EquitySenior-level Full TimeUS, CA, Santa Clara, United States22d ago
-
SOC Architect, XProf USD 147K-211KC# | C++ | Compiler profiling | Data Analysis | Data VisualizationSenior-level Full TimeSunnyvale, CA, USA22d ago
-
Machine Learning Engineer, Runtime & Optimization USD 213K-263KC++ | CUDA | Deep learning | JAX | Machine LearningCompany benefits program | Discretionary annual bonus | Equity incentive planSenior-level Full TimeMountain View, California, USA27d ago
-
ML Runtime Optimization Engineer USD 159K-199KCPU | CUDA | Deep learning | Embedded Systems | GPU401k match | Dental insurance | Disability insurance | Health insurance | Learning stipendSenior-level Full TimeSunnyvale, California, United States27d ago
-
Agentic AI | Autonomous Driving | Compiler technology | Curriculum Design | Deep learningMid-level Full TimeUS, CA, Santa Clara, United States29d ago
-
Autotuning | Benchmarking | C++ | CUDA | Code generationSenior-level Full TimeSunnyvale, CA, USA1mo ago
-
ARM Mali | Android | Apple Neural Engine | C++ | CoreMLCommute subsidy | Employee assistance program | Employee resource groups | Employee stock ownership | Generous vacation and personal daysSenior-level Full TimeMountain View, CA, USA1mo ago
-
Senior Software Engineer, Machine Learning, Core ML USD 174K-252KC++ | Compiler optimization | Data Processing | Data parallelism | DebuggingSenior-level Full TimeMountain View, CA, USA1mo ago
-
Principal Deep Learning Communication Architect USD 272K-431K3D Parallelism | CUDA | Context Parallelism | Data parallelism | DeepSpeedSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior Software Engineer, LLM Performance USD 180K-339KC++ | CUDA | Cutlass | FlashAttention | FlashInferSenior-level Full TimeSF Bay Area (Hybrid) R1mo ago
-
Compiler Engineer - Machine Learning Compiler USD 170K-251KBackend Development | C++ | Code generation | Compiler Intermediate Representation | Constraint ProgrammingMid-level Full TimePalo Alto, CA1mo ago