Find jobs in AI/ML, Data Science and Big Data
21 results
for Continuous batching
(Skill/Tech stack)
-
AI Performance Optimization Engineer USD 136K-258KC++ | Continuous batching | Deep learning | Distributed Systems | FSDPMid-level Full TimeUnited States - Remote R2d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Cache optimization | Continuous batching | Cutlass | Deep learningMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess patterns | Benchmarking | C++ | Cache optimization | Compiler optimizationFull-time W2 employment | Health benefits | Remote workMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 159K-264KC++ | Continuous batching | Cutlass | Deep learning | DeepSpeedRemote workMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 136K-258KBenchmarking | C++ | Compiler optimization | Continuous batching | DebuggingMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess Optimization | Attention Optimization | Benchmarking | C++ | Compiler optimizationMid-level Full TimeUnited States - Remote R4d ago
-
Mid-level Full TimeSeattle (WA), United States5d ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India7d ago
-
AI Platform Engineer INR 1500K-2500KAlerting | CUDA | Cause analysis | Continuous batching | GPU ProfilingMid-level Full TimeBangalore, India8d ago
-
Staff Forward Deployed Engineer USD 195K-239KArtificial Intelligence | Benchmarking | CUDA | CUDA Interconnect | Continuous batchingEmployee assistance program | Flexible time off | Hybrid work | LinkedIn Learning | Local Employee MeetupsSenior-level Full TimeSeattle12d ago
-
Staff Forward Deployed Engineer USD 195K-239KArtificial Intelligence | Benchmarking | CUDA | Continuous batching | CrewAIConference reimbursement | Employee assistance program | Employee stock purchase program | Flexible time off | LinkedIn LearningSenior-level Full TimeSan Francisco R12d ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA12d ago
-
Principal Model Optimization Engineer USD 295K-345KCUDA | Continuous batching | GPU | LLM Inference | Machine LearningSenior-level Full TimeSan Mateo, CA, United States R15d ago
-
Senior Machine Learning Engineer, Runtime and Serving USD 213K-263KBenchmarking | Buffer management | C++ | CUDA | Concurrent SystemsSenior-level Full TimeMountain View, CA, USA21d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China21d ago
-
Member of technical staff (Inference) - Paris EUR 80K-120KC++ | CUDA | Caching | Continuous batching | Distributed ComputingCareer development | Continuous learning | Hybrid work | Professional growthSenior-level Full TimeParis1mo ago
-
Member of technical staff (Inference) - London GBP 230K-325KC++ | CUDA | CUDA kernel | CUDA kernel programming | CachingContinuous learning | Hybrid work | Professional developmentSenior-level Full TimeLondon1mo ago
-
Senior Forward Deployed Engineer (AI/ML) USD 140K-174KBenchmarking | CUDA | Continuous batching | CrewAI | DatabasesConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Local Employee MeetupsSenior-level Full TimeSeattle R1mo ago
-
Continuous batching | Jupyter | KV cache | Low Latency | Machine LearningDaily meals | Housing subsidy | Medical, dental & vision coverage | Relocation supportMid-level Full TimeCupertino, CA1mo ago
-
Inference Software Engineer USD 150K-275KC++ | CUDA | Continuous batching | Distributed Systems | KV cacheDaily meals | Housing subsidy | Medical, dental & vision coverage | Relocation supportSenior-level Full TimeCupertino, CA1mo ago
-
AWQ | C++ | CUDA | CUDA kernels | Continuous batchingSenior-level Full TimeDonostia, Spain1mo ago