AI Research Engineer (Model Compression & Quantization)
Tasks
- Apply pruning to reduce parameters and compute
- Apply quantization to reduce model size and latency
- Build compression pipelines
- Define performance and fidelity metrics
- Develop model compression strategies
- Document experiments and results
- Implement knowledge distillation for smaller multimodal models
- Optimize inference bottlenecks in production
- Publish technical papers
- Research mixed precision quantization methods
Perks/Benefits
Skills/Tech-stack
Backpropagation | Finetuning | GenerativeAI | KnowledgeDistillation | MixedPrecision | ModelPruning | MultimodalAI | NeuralNetworkArchitectures | Optimization | PostTrainingQuantization | PyTorch | QuantizationAwareTraining | Transformers
Education
Roles
Related jobs
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R3h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R5h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R10h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R11h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication requirement | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Knowledge Distillation | Mixed Precision | Model PruningFlexible collaboration | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
Senior AI Engineer - RAG & AI Agents EUR 56K-82KAI Agents | AWS | CI/CD | ChromaDB | CouchbaseAnnual learning budget | Equipment budget | Fitness subscription | Flexible hours | Health insuranceSenior-level Full TimeRemote R18h ago
-
Machine Learning Engineer USD 150K-215KData Augmentation | Deep learning | Isaac | Loss Functions | Medical ImagingMid-level Full TimeSan Francisco (hybrid) R19h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote work flexibility | Research publication supportSenior-level Full TimeRemote job R20h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R20h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionPublication opportunities | Remote workSenior-level Full TimeRemote job R20h ago
-
Member of Engineering (Pre-training / Data Research) USD 160K-300KCurriculum learning | Data Ablation | Data Curation | Data Pipelines | Data mixingCompany-provided equipment | Flexible hours | Frequent team get togethers | Fully remote work | Health insurance allowanceMid-level Full TimeRemote (EMEA/East Coast) R22h ago
-
Audio Processing | Autoregression | Autoregressive models | Computer Vision | Deep learningRemote workSenior-level Full TimeRemote job R22h ago
-
AI Scientist DKK 499K-734KApache Spark | Azure | Databricks | Deep learning | Delta LakeBusiness resource groups | Charitable donation stipend | Flexible work hours | Health stipend | Paid time offMid-level Full TimeCopenhagen R23h ago
-
AI Engineer (GCP) PLN 250K-384KAgile | CI/CD | Computer Vision | Databricks | DockerCareer growth | Conference support | Flexible work hours | Integration budget | Medical insuranceSenior-level Full TimeRemote job R1d ago
-
Data Scientist / AI/ML Engineer (Imagery) VAWFH 1652 USD 153K-207KAccuracy | Computer Vision | Containerization | Data Cleansing | Data PreprocessingSenior-level Full TimeReston, VA R1d ago
-
AI Software Engineer II (R-18544) EUR 52K-68KAPI Development | Agentic Workflows | CI/CD | FastAPI | GCPEducational assistance program | Employee Health Insurance | Family-friendly leave | Flexible working (hybrid model) | Holiday buy & sellMid-level Full TimeDublin - Ireland R1d ago
-
AWS | Access Control | Airflow | Amazon Redshift | AuthenticationFlexible work hours | Remote workSenior-level Full TimePortugal - Remote R1d ago
-
API Security | Access Control | Airflow | Amazon Redshift | AuthenticationFlexible working hours | Remote workSenior-level Full TimeSouth Africa - Remote R1d ago
-
Senior Principal Machine Learning Engineer, vLLM USD 206K-351KCPU architecture | Code review | Computer Vision | Deep learning | GPU Architecture401k employer match | Employee stock purchase plan | Flexible spending account | Health savings account | Paid parental leaveSenior-level Full TimeBoston, United States R1d ago
-
Senior Machine Learning Engineer USD 174K-287KComputer Vision | Deep learning | Gradient optimization | Graph theory | Inference OptimizationPaid parental leave | Paid time offSenior-level Full TimeBoston, United States R1d ago