AI Research Engineer (Model Compression & Quantization) - 100% Remote Worldwide
Tasks
- Build and monitor inference tests in simulated and live environments
- Create test datasets and simulation scenarios for low resource devices
- Design model serving architectures for low latency high throughput
- Diagnose and resolve inference bottlenecks in production
- Establish performance metrics and evaluation frameworks
- Integrate inference frameworks into edge and on device production pipelines
- Optimize memory usage for inference pipelines
Perks/Benefits
Skills/Tech-stack
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism | Flash Attention | GPU Kernels | High Throughput | Inference Optimization | Inference Pipelines | KV cache | Low Latency | Machine Learning | Memory Optimization | Metal Shading Language | Mobile Devices | Model Compression | Model Pruning | Model Quantization | Model Serving | NLP | Pipeline parallelism | Shading language | Speculative decoding | Tensor Parallelism | Vision Transformers
Education
Roles
Related jobs
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R4h ago
-
AWS | Apache Airflow | Azure | CI/CD | Data EngineeringCareer growth opportunities | Continuous learning | Flexible working hours | Fully remote | Home office setup supportSenior-level Full TimeBrazil R5h ago
-
Cloud Computing | Data Pipelines | Debugging | Deployment | ETLCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remoteMid-level Full TimeNetherlands R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R5h ago
-
Cloud Computing | ETL | Google Colab | Information Retrieval | Jupyter NotebooksCareer growth opportunities | Coworking access | Employee benefits | Flexible schedule | Fully remote workMid-level Full TimeSwitzerland R5h ago
-
Cloud Computing | Data pipeline | Debugging | ETL | Google ColabCareer growth | Continuous learning | Flexible work hours | Fully remote | International collaborationMid-level Full TimeFrance R5h ago
-
Cloud Computing | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Flexible work schedule | Fully remote | Inclusive culture | Optional coworking accessMid-level Full TimeSpain R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeBrazil R6h ago
-
Cloud infrastructure | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Continuous learning opportunities | Coworking access | Flexible work hours | Fully remoteMid-level Full TimeGermany R6h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R10h ago
-
AI Solution Strategist USD 125K-188KArtificial Intelligence | Conversational Design | Customer Experience | Customer Success | Customer experience strategyMid-level Full TimeUSA - Remote R11h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R11h ago
-
Sr. Data Engineer II (6516) USD 152K-188KAWS | Apache NiFi | Apache Spark | Cloudera | Data Architecture401k match | Dependent care | Employee Assistance and Wellness Programs | Flexible work arrangements | Health, dental, vision insuranceMid-level Full TimeRemote R14h ago
-
Pessoa Engenheira de Dados Senior BRL 18K-18KAWS Glue | AWS Lake Formation | Amazon Athena | Amazon DynamoDB | Amazon EMRCollaborative work environment | Innovation culture | Mentoring | Professional growthSenior-level Full TimeRemoto R15h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication requirement | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Knowledge Distillation | Mixed Precision | Model PruningFlexible collaboration | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
Principal AI Engineer USD 167K-242KAWS | Anthropic API | Artificial Intelligence | Continuous integration | DatabricksSenior-level Full TimeRemote R18h ago
-
Senior AI Engineer - RAG & AI Agents EUR 56K-82KAI Agents | AWS | CI/CD | ChromaDB | CouchbaseAnnual learning budget | Equipment budget | Fitness subscription | Flexible hours | Health insuranceSenior-level Full TimeRemote R18h ago
-
Product Manager (AI/ML) INR 1068K-2000KAI | Accuracy | Agile | Backlog Management | ClassificationEquity | Family insurance coverage | Flexible work hours | Health teleconsultations | Hybrid work setupMid-level Full TimeHybrid - Bangalore, India R19h ago