AI Research Engineer (Model Compression & Quantization)
Tasks
- Address production inference bottlenecks
- Analyze accuracy latency and memory trade-offs
- Apply low-bit quantization for generative AI
- Author technical papers for peer reviewed conferences
- Build compression pipelines
- Document methodologies experiments and results
- Establish performance and fidelity metrics
- Implement pruning for redundant parameters removal
- Leverage knowledge distillation for smaller student models
- Research mixed precision quantization and advanced compression strategies
Perks/Benefits
Skills/Tech-stack
Backpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed Precision | Model Pruning | Neural Networks | Optimization | Post-training | Post-training Quantization | PyTorch | Python | Quantization | Quantization aware training | Transformers
Education
Roles
Related jobs
-
Featured Feat. AI Engineer (MTS) USD 160K-300KAPI Development | AWS | Amazon Web Services | Deep learning | FastAPIMentoring | Open source contributions | Remote workMid-levelRemote R19d ago
-
Featured Feat. Data Engineer USD 80K-150KData Monitoring | Data Quality | Data Validation | ELT | ETLRemote workEntry-levelRemote R19d ago
-
AWS | Apache Airflow | Azure | CI/CD | Data EngineeringCareer growth opportunities | Continuous learning | Flexible working hours | Fully remote | Home office setup supportSenior-level Full TimeBrazil R3h ago
-
Cloud Computing | Data Pipelines | Debugging | Deployment | ETLCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remoteMid-level Full TimeNetherlands R4h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R4h ago
-
Cloud Computing | ETL | Google Colab | Information Retrieval | Jupyter NotebooksCareer growth opportunities | Coworking access | Employee benefits | Flexible schedule | Fully remote workMid-level Full TimeSwitzerland R4h ago
-
Cloud Computing | Data pipeline | Debugging | ETL | Google ColabCareer growth | Continuous learning | Flexible work hours | Fully remote | International collaborationMid-level Full TimeFrance R4h ago
-
Cloud Computing | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Flexible work schedule | Fully remote | Inclusive culture | Optional coworking accessMid-level Full TimeSpain R4h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeBrazil R4h ago
-
Cloud infrastructure | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Continuous learning opportunities | Coworking access | Flexible work hours | Fully remoteMid-level Full TimeGermany R4h ago
-
1094- AI Platform Engineer (Generative AI) USD 158K-168KAI Agents | AWS | CI/CD | DevOps | Generative AIRemote workSenior-level Full TimeRemote R8h ago
-
Senior Data Engineer USD 120K-160KApache Airflow | BigQuery | Cloud Composer | Cloud Storage | Cloud platform401k employer match | Bereavement leave | Employee referral bonus | Health insurance | Paid HolidaysSenior-level Full TimeRemote R9h ago
-
C plus plus | C# | CAD | Dynamics | FDA Compliance401k | Company holidays | Dental insurance | Health insurance | Paid maternity/paternity leaveSenior-level Full TimeLos Angeles, California R9h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R10h ago
-
Insights Product Manager - Analytics Engineering GBP 50K-68KAmplitude | Anomaly alerting | CI/CD | DBT | Data CatalogAnnual leave | Counselling access | Employee assistance program | Free Economist content access | Moving home supportMid-level Full TimeLondon - Commercial R10h ago
-
AI Engineer USD 131K-185KAnthropic API | Apps Script | Autogen | Cloud deployment | CrewAIAsync first collaboration | Conversion to employment based on performance | Direct access to leadership | Fast feedback loops | Fully remoteMid-level Full TimeUnited R12h ago
-
Sr. Data Engineer II (6516) USD 152K-188KAWS | Apache NiFi | Apache Spark | Cloudera | Data Architecture401k match | Dependent care | Employee Assistance and Wellness Programs | Flexible work arrangements | Health, dental, vision insuranceMid-level Full TimeRemote R12h ago
-
Senior Software Engineer USD 140K-185KAWS | Automated testing | Azure | C++ | Git401K company matching | Dental insurance | Dependent care benefits | Flexible spending account | Health insuranceSenior-level Full TimeBoulder, CO R13h ago
-
Pessoa Engenheira de Dados Senior BRL 18K-18KAWS Glue | AWS Lake Formation | Amazon Athena | Amazon DynamoDB | Amazon EMRCollaborative work environment | Innovation culture | Mentoring | Professional growthSenior-level Full TimeRemoto R14h ago
-
AWS | CI/CD | Docker | FastAPI | KubernetesCompetitive salary | Direct founder collaboration | Equity | Growth path to senior roles | Hybrid workMid-level Full TimeArgentina / Hybrid R15h ago
-
Senior Solutions Engineer - Qatar & S.Africa Fly-in GBP 70K-100KAI Agents | AWS | Apache Spark | Apache Spark architecture | Artificial IntelligenceHybrid work schedule | Travel for customer visits and events | Workshops seminars and community buildingSenior-level Full TimeLondon, United Kingdom; Paris, France R16h ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R16h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R16h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R16h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | Finetuning | GenerativeAI | KnowledgeDistillation | MixedPrecisionRemote workSenior-level Full TimeRemote job R16h ago