AI Research Engineer (Model Compression & Quantization)
Tasks
- Address bottlenecks in production inference on edge devices
- Analyze accuracy latency and memory trade-offs
- Apply low bit quantization to reduce model size and latency
- Author technical papers for top tier conferences
- Build compression pipelines and metrics for performance and fidelity
- Document experiments and results for reproducibility
- Implement pruning to remove redundant parameters
- Leverage knowledge distillation to compress large models
- Research mixed precision quantization and advanced compression strategies
Perks/Benefits
Skills/Tech-stack
Backpropagation | C++ | Knowledge Distillation | Mixed Precision | Model Pruning | Neural Networks | Optimization | Post-training | Post-training Quantization | PyTorch | Quantization | Quantization aware training | Transformers
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Roles
Related jobs
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R3h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R5h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R10h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R11h ago
-
Senior Software Engineer USD 140K-185KAWS | Automated testing | Azure | C++ | Git401K company matching | Dental insurance | Dependent care benefits | Flexible spending account | Health insuranceSenior-level Full TimeBoulder, CO R15h ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R17h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | Finetuning | GenerativeAI | KnowledgeDistillation | MixedPrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication requirement | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
Senior AI Engineer - RAG & AI Agents EUR 56K-82KAI Agents | AWS | CI/CD | ChromaDB | CouchbaseAnnual learning budget | Equipment budget | Fitness subscription | Flexible hours | Health insuranceSenior-level Full TimeRemote R18h ago
-
Machine Learning Engineer USD 150K-215KData Augmentation | Deep learning | Isaac | Loss Functions | Medical ImagingMid-level Full TimeSan Francisco (hybrid) R19h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote work flexibility | Research publication supportSenior-level Full TimeRemote job R20h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R20h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionPublication opportunities | Remote workSenior-level Full TimeRemote job R20h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelismRemote workSenior-level Full TimeRemote job R20h ago
-
Diffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash Attention100 percent remote | Worldwide remoteSenior-level Full TimeRemote job R20h ago
-
Mid-level Full TimeSlovenia / Remote R22h ago
-
Member of Engineering (Pre-training / Data Research) USD 160K-300KCurriculum learning | Data Ablation | Data Curation | Data Pipelines | Data mixingCompany-provided equipment | Flexible hours | Frequent team get togethers | Fully remote work | Health insurance allowanceMid-level Full TimeRemote (EMEA/East Coast) R22h ago
-
Benchmarking | Data Balancing | Data Filtering | Dataset curation | Distributed TrainingRemote work worldwideSenior-level Full TimeRemote job R22h ago
-
Audio Processing | Autoregression | Autoregressive models | Computer Vision | Deep learningRemote workSenior-level Full TimeRemote job R22h ago