AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build and monitor inference tests
- Create test datasets and simulation scenarios
- Deploy inference pipelines
- Design model serving architectures
- Identify and resolve serving bottlenecks
- Integrate inference frameworks into production pipelines
- Optimize inference strategies
- Track performance metrics
Perks/Benefits
Skills/Tech-stack
Computer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash Attention | GPU Kernels | Inference Optimization | KV cache | Level optimization | Low-level optimization | Machine Learning | Memory Management | Mobile optimization | Model Serving | NLP | Neural Networks | On-device Inference | Pipeline parallelism | Pruning | Quantization | Speculative decoding | Tensor Parallelism | Vision Transformers
Education
Related jobs
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R3h ago
-
AWS | Apache Airflow | Azure | CI/CD | Data EngineeringCareer growth opportunities | Continuous learning | Flexible working hours | Fully remote | Home office setup supportSenior-level Full TimeBrazil R4h ago
-
Cloud Computing | Data Pipelines | Debugging | Deployment | ETLCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remoteMid-level Full TimeNetherlands R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R5h ago
-
Cloud Computing | ETL | Google Colab | Information Retrieval | Jupyter NotebooksCareer growth opportunities | Coworking access | Employee benefits | Flexible schedule | Fully remote workMid-level Full TimeSwitzerland R5h ago
-
Cloud Computing | Data pipeline | Debugging | ETL | Google ColabCareer growth | Continuous learning | Flexible work hours | Fully remote | International collaborationMid-level Full TimeFrance R5h ago
-
Cloud Computing | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Flexible work schedule | Fully remote | Inclusive culture | Optional coworking accessMid-level Full TimeSpain R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeBrazil R5h ago
-
Cloud infrastructure | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Continuous learning opportunities | Coworking access | Flexible work hours | Fully remoteMid-level Full TimeGermany R5h ago
-
AI Solution Strategist USD 125K-188KArtificial Intelligence | Conversational Design | Customer Experience | Customer Success | Customer experience strategyMid-level Full TimeUSA - Remote R10h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R10h ago
-
Sr. Data Engineer II (6516) USD 152K-188KAWS | Apache NiFi | Apache Spark | Cloudera | Data Architecture401k match | Dependent care | Employee Assistance and Wellness Programs | Flexible work arrangements | Health, dental, vision insuranceMid-level Full TimeRemote R13h ago
-
Pessoa Engenheira de Dados Senior BRL 18K-18KAWS Glue | AWS Lake Formation | Amazon Athena | Amazon DynamoDB | Amazon EMRCollaborative work environment | Innovation culture | Mentoring | Professional growthSenior-level Full TimeRemoto R14h ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R17h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R17h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R17h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication requirement | Remote workSenior-level Full TimeRemote job R17h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R17h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Knowledge Distillation | Mixed Precision | Model PruningFlexible collaboration | Remote workSenior-level Full TimeRemote job R17h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication support | Remote workSenior-level Full TimeRemote job R17h ago
-
Principal AI Engineer USD 167K-242KAWS | Anthropic API | Artificial Intelligence | Continuous integration | DatabricksSenior-level Full TimeRemote R17h ago
-
Senior AI Engineer - RAG & AI Agents EUR 56K-82KAI Agents | AWS | CI/CD | ChromaDB | CouchbaseAnnual learning budget | Equipment budget | Fitness subscription | Flexible hours | Health insuranceSenior-level Full TimeRemote R17h ago
-
Product Manager (AI/ML) INR 1068K-2000KAI | Accuracy | Agile | Backlog Management | ClassificationEquity | Family insurance coverage | Flexible work hours | Health teleconsultations | Hybrid work setupMid-level Full TimeHybrid - Bangalore, India R18h ago
-
AI Agents | AWS | Apache Spark | Artificial Intelligence | Big DataFixed term contract to FTE conversion | Hybrid scheduleSenior-level Full TimeAmsterdam, Netherlands R18h ago
-
Senior Data Engineer GBP 70K-80KData Modeling | Data Pipelines | Data Preprocessing | Feature Engineering | Machine LearningAnnual leave | Enhanced parental leave | Flexible working | Hardware allowance | Learning and development budgetSenior-level Full TimeUK (Remote) R19h ago