AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build inference test harnesses
- Create evaluation datasets and simulations
- Design model serving architectures
- Enable low latency high throughput scalable inference
- Identify bottlenecks in production
- Implement inference algorithms
- Integrate inference frameworks into production pipelines
- Monitor performance metrics
- Optimize computational efficiency
- Optimize inference pipelines
Perks/Benefits
Skills/Tech-stack
Diffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelism | Flash Attention | GPU Kernels | Inference Optimization | Inference Systems | KV cache | Language Processing | Latency optimization | Low Latency | Low Memory | Machine Learning | Memory Optimization | Mobile optimization | Model Architecture | Model Serving | NLP | Natural Language | Natural Language Processing | On-device Inference | Pipeline parallelism | Pruning | Quantization | Response Optimization | Speculative decoding | Tensor Parallelism | Throughput Optimization | Token Response Optimization | Vision Transformers
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R3h ago
-
AWS | Apache Airflow | Azure | CI/CD | Data EngineeringCareer growth opportunities | Continuous learning | Flexible working hours | Fully remote | Home office setup supportSenior-level Full TimeBrazil R4h ago
-
Cloud Computing | Data Pipelines | Debugging | Deployment | ETLCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remoteMid-level Full TimeNetherlands R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R5h ago
-
Cloud Computing | ETL | Google Colab | Information Retrieval | Jupyter NotebooksCareer growth opportunities | Coworking access | Employee benefits | Flexible schedule | Fully remote workMid-level Full TimeSwitzerland R5h ago
-
Cloud Computing | Data pipeline | Debugging | ETL | Google ColabCareer growth | Continuous learning | Flexible work hours | Fully remote | International collaborationMid-level Full TimeFrance R5h ago
-
Cloud Computing | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Flexible work schedule | Fully remote | Inclusive culture | Optional coworking accessMid-level Full TimeSpain R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeBrazil R6h ago
-
Cloud infrastructure | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Continuous learning opportunities | Coworking access | Flexible work hours | Fully remoteMid-level Full TimeGermany R6h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R10h ago
-
AI Solution Strategist USD 125K-188KArtificial Intelligence | Conversational Design | Customer Experience | Customer Success | Customer experience strategyMid-level Full TimeUSA - Remote R11h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R11h ago
-
Sr. Data Engineer II (6516) USD 152K-188KAWS | Apache NiFi | Apache Spark | Cloudera | Data Architecture401k match | Dependent care | Employee Assistance and Wellness Programs | Flexible work arrangements | Health, dental, vision insuranceMid-level Full TimeRemote R14h ago
-
Pessoa Engenheira de Dados Senior BRL 18K-18KAWS Glue | AWS Lake Formation | Amazon Athena | Amazon DynamoDB | Amazon EMRCollaborative work environment | Innovation culture | Mentoring | Professional growthSenior-level Full TimeRemoto R15h ago
-
Senior Software Engineer for AI USD 149K-208KAWS | Anthropic Claude | Cloud infrastructure | Code Reviews | Data PrivacySenior-level Full TimeRemote- United States R17h ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication requirement | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Knowledge Distillation | Mixed Precision | Model PruningFlexible collaboration | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
Principal AI Engineer USD 167K-242KAWS | Anthropic API | Artificial Intelligence | Continuous integration | DatabricksSenior-level Full TimeRemote R18h ago
-
Senior AI Engineer - RAG & AI Agents EUR 56K-82KAI Agents | AWS | CI/CD | ChromaDB | CouchbaseAnnual learning budget | Equipment budget | Fitness subscription | Flexible hours | Health insuranceSenior-level Full TimeRemote R18h ago