Find jobs in AI/ML, Data Science and Big Data
8 results
for Speculative decoding
(Skill/Tech stack)
-
Mid-level Full TimeBeijing, Beijing, CN; Suzhou, Jiangsu, CN2d ago
-
Machine Learning Engineer CAD 123K-192KDeep learning | Gradient descent | Graph theory | Language Models | Large Language ModelsMid-level Full TimeToronto - MSO, Canada R3d ago
-
Member of Technical Staff - Inference USD 200K-300KAWS | Ansible | Benchmarks | C++ | CUDACompetitive compensation | Conference attendance | Equity incentives | Flexible work | Professional developmentSenior-level Full TimeRemote R6d ago
-
Software Engineer, Inference Platform USD 200K-250KCUDA | Distributed Systems | Expert parallelism | GPU Compute | GPU OptimizationDental insurance | Equity | Health insurance | PTO policy | Retirement planMid-level Full TimeSan Francisco, CA15d ago
-
Adaptive compute | Applied Mathematics | C# | C++ | Computer VisionHigh-impact projects | Research publication opportunities | Team culture developmentEntry-level InternshipMenlo Park, CA21d ago
-
Solutions Architect, Inference Deployments USD 152K-241KAI Inference | AI inference workloads | Disaggregated inference | GPU Operator | GPU OrchestrationBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States22d ago
-
Staff Software Engineer, ML Infrastructure USD 300K-430KBatching Strategies | Distributed Training | Fault Tolerance | Inference architecture | JAXDental | Lunches | Medical | Snacks | VacationSenior-level Full TimeSan Francisco24d ago
-
Research Engineer, Core ML USD 200K-280KAPIs | Backend Development | Deep learning | Distributed Systems | FasterTransformerCompetitive benefits | Health insurance | Startup equitySenior-level Full TimeSan Francisco30d ago