AI Research Engineer (Model Compression & Quantization)
Tasks
- Address production inference bottlenecks
- Analyze efficiency accuracy tradeoffs
- Apply low bit quantization
- Build compression pipelines
- Document methodologies experiments and results
- Establish performance and fidelity metrics
- Implement pruning techniques
- Leverage knowledge distillation
- Publish research papers
- Research mixed precision quantization
Perks/Benefits
Skills/Tech-stack
Backpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed Precision | Model Pruning | Neural Networks | Post-training | Post-training Quantization | PyTorch | Quantization | Quantization aware training | Transformers
Education
Related jobs
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R4h ago
-
Software Engineer - Storage MXN 934K-1260KAI Assisted Development | AI-Assisted Development Tools | Architecture | Backend Development | C#Senior-level Full TimeRemote - Argentina; Remote - Colombia … R10h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R10h ago
-
Computer Vision | Deep learning | Docker | GPU Computing | Inference ServerActive lifestyle reimbursement | Continuous learning bonus | Health care reimbursement | Home office reimbursement | Paid time offMid-level Full TimeMexico - Remote R10h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R12h ago
-
Senior Software Engineer USD 140K-185KAWS | Automated testing | Azure | C++ | Git401K company matching | Dental insurance | Dependent care benefits | Flexible spending account | Health insuranceSenior-level Full TimeBoulder, CO R15h ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | Finetuning | GenerativeAI | KnowledgeDistillation | MixedPrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication requirement | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Knowledge Distillation | Mixed Precision | Model PruningFlexible collaboration | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
Senior AI Engineer - RAG & AI Agents EUR 56K-82KAI Agents | AWS | CI/CD | ChromaDB | CouchbaseAnnual learning budget | Equipment budget | Fitness subscription | Flexible hours | Health insuranceSenior-level Full TimeRemote R19h ago
-
Machine Learning Engineer USD 150K-215KData Augmentation | Deep learning | Isaac | Loss Functions | Medical ImagingMid-level Full TimeSan Francisco (hybrid) R20h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R21h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionPublication opportunities | Remote workSenior-level Full TimeRemote job R21h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelismRemote workSenior-level Full TimeRemote job R21h ago
-
Diffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash Attention100 percent remote | Worldwide remoteSenior-level Full TimeRemote job R21h ago
-
Mid-level Full TimeSlovenia / Remote R23h ago
-
Member of Engineering (Pre-training / Data Research) USD 160K-300KCurriculum learning | Data Ablation | Data Curation | Data Pipelines | Data mixingCompany-provided equipment | Flexible hours | Frequent team get togethers | Fully remote work | Health insurance allowanceMid-level Full TimeRemote (EMEA/East Coast) R23h ago
-
Benchmarking | Data Balancing | Data Filtering | Dataset curation | Distributed TrainingRemote work worldwideSenior-level Full TimeRemote job R23h ago