Find jobs in AI/ML, Data Science and Big Data
20 results
for FP8
(Skill/Tech stack)
-
AWQ | AWS | Batching | CPU architecture | CUDASenior-level Full TimeGuangzhou, Guangdong, China3d ago
-
Algorithm Engineer - Deep Learning USD 136K-231KC++ | Computer Vision | Deep learning | FP16 | FP8401k match | Dental insurance | Employee assistance program | Employee stock purchase program | Life insuranceSenior-level Full TimeUSA-CA-Milpitas-KLA, United States4d ago
-
Applied Scientist 5 INR 2475K-4500K3D Reconstruction | Adapters | CLIP | Computer Vision | ControlNetSenior-level Full TimeBangalore, India R4d ago
-
AWS Trainium | BF16 | Deep learning | Distributed Training | FP8Entry-level Full TimeCupertino, California, USA6d ago
-
Artificial Intelligence | Bottleneck analysis | CUDA | Deep learning | Diffusion ModelsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States7d ago
-
Mid-level Full Time北京 R8d ago
-
具身世界模型推理INFRA工程师 - XiaomiRobotics CNY 240K-480KCFG Parallelism | Diffusion Models | Expert parallelism | FP8 | Machine LearningSenior-level Full Time北京8d ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R19d ago
-
Senior Software Engineer, CUDA Deep Learning Systems USD 184K-356KC++ | CUDA | CUDA kernel | CUDA kernel optimization | Computer ArchitectureEquity | Health benefits | Paid time offSenior-level Full TimeUS, CA, Santa Clara, United States25d ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA1mo ago
-
Staff Software Engineer, Inference USD 188K-275KBF16 | C++ | CUDA | Distributed Systems | FP8401k employer match | Dental insurance | Employee stock purchase program | Flexible PTO | Flexible spending accountSenior-level Full TimeSunnyvale, CA / Bellevue, WA1mo ago
-
AI Engineer - Model Performance USD 165K-250KAttention Backend | Audio Processing | Batching | CUDA | CUDA graphAsync communication | Innovation-focused culture | Remote work | Startup environment | Supportive teamMid-level Full TimeSF Hybrid R1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
C++ | CUDA | CUDA profiling | Collective communication | Communication Compute OverlapSenior-level Full TimeIsrael, Tel Aviv R1mo ago
-
Tech Lead, Robotic AI Model USD 150K-180KAction Chunking | Action Tokenization | Behavior Cloning | DPO | DeepSpeedSenior-level Full TimeEl Segundo, California, United States1mo ago
-
Senior Deep Learning Algorithms Engineer - BioNeMo USD 180K-333KApplied Machine Learning | C++ | CUDA | CUDA kernels | Deep learningSenior-level Full TimeVietnam, Ho Chi Minh City1mo ago
-
Debugging | Deep learning | Distributed Systems | FP8 | GPUComprehensive benefits package | Hybrid work modelSenior-level Full TimeIsrael, Yokneam1mo ago
-
AI Inference Engineer - Model Optimization & Deployment USD 205K-303KAccuracy evaluation | BF16 | C++ | CUDA | CUDA kernelsSenior-level Full TimeFoster City, CA1mo ago
-
Senior Solutions Architect II - AI/ML USD 150K-200KArtificial Intelligence | CI/CD | CUDA | CUDA toolkit | ChatbotsConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Remote workMid-level Full TimeSan Francisco R1mo ago