AI Research Engineer (Kernel & Inference Optimization) - 100% Remote Worldwide
Tasks
- Build inference evaluation pipelines
- Design model serving architectures
- Identify and resolve inference bottlenecks
- Implement distributed inference parallelism
- Monitor inference performance metrics
- Optimize GPU kernel performance
- Optimize inference strategies
- Prepare test datasets and simulation scenarios
Perks/Benefits
Skills/Tech-stack
Diffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash Attention | GPU Kernels | High Throughput | Inference Optimization | Inference Systems | KV cache | Language Processing | Low Latency | Machine Learning | Memory Optimization | Model Serving | Model architectures | Natural Language | Natural Language Processing | Performance Metrics | Pipeline parallelism | Pruning | Quantization | Speculative decoding | Tensor Parallelism | Vision Transformers
Education
Roles
Related jobs
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R3h ago
-
AWS | Apache Airflow | Azure | CI/CD | Data EngineeringCareer growth opportunities | Continuous learning | Flexible working hours | Fully remote | Home office setup supportSenior-level Full TimeBrazil R4h ago
-
Cloud Computing | Data Pipelines | Debugging | Deployment | ETLCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remoteMid-level Full TimeNetherlands R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R5h ago
-
Cloud Computing | ETL | Google Colab | Information Retrieval | Jupyter NotebooksCareer growth opportunities | Coworking access | Employee benefits | Flexible schedule | Fully remote workMid-level Full TimeSwitzerland R5h ago
-
Cloud Computing | Data pipeline | Debugging | ETL | Google ColabCareer growth | Continuous learning | Flexible work hours | Fully remote | International collaborationMid-level Full TimeFrance R5h ago
-
Cloud Computing | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Flexible work schedule | Fully remote | Inclusive culture | Optional coworking accessMid-level Full TimeSpain R5h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeBrazil R6h ago
-
Cloud infrastructure | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Continuous learning opportunities | Coworking access | Flexible work hours | Fully remoteMid-level Full TimeGermany R6h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R10h ago
-
AI Solution Strategist USD 125K-188KArtificial Intelligence | Conversational Design | Customer Experience | Customer Success | Customer experience strategyMid-level Full TimeUSA - Remote R11h ago
-
Senior Machine Learning Engineer, Model Training & Evaluation INR 2500K-4500KBenchmarking | Checkpointing | DeepSpeed | Distributed Training | Experiment trackingAccidental insurance | Flexible hours | Hybrid work | Life insurance | Medical insuranceSenior-level Full TimeBangalore, India (Hybrid) R11h ago
-
Sr. Data Engineer II (6516) USD 152K-188KAWS | Apache NiFi | Apache Spark | Cloudera | Data Architecture401k match | Dependent care | Employee Assistance and Wellness Programs | Flexible work arrangements | Health, dental, vision insuranceMid-level Full TimeRemote R14h ago
-
Pessoa Engenheira de Dados Senior BRL 18K-18KAWS Glue | AWS Lake Formation | Amazon Athena | Amazon DynamoDB | Amazon EMRCollaborative work environment | Innovation culture | Mentoring | Professional growthSenior-level Full TimeRemoto R15h ago
-
Senior Software Engineer for AI USD 149K-208KAWS | Anthropic Claude | Cloud infrastructure | Code Reviews | Data PrivacySenior-level Full TimeRemote- United States R17h ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionRemote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication requirement | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 206K-327KBackpropagation | C++ | Knowledge Distillation | Mixed Precision | Model PruningFlexible collaboration | Remote workSenior-level Full TimeRemote job R18h ago
-
AI Research Engineer (Model Compression & Quantization) USD 210K-330KBackpropagation | C++ | Fine Tuning | Knowledge Distillation | Mixed PrecisionEnglish communication support | Remote workSenior-level Full TimeRemote job R18h ago
-
Principal AI Engineer USD 167K-242KAWS | Anthropic API | Artificial Intelligence | Continuous integration | DatabricksSenior-level Full TimeRemote R18h ago
-
Product Manager (AI/ML) INR 1068K-2000KAI | Accuracy | Agile | Backlog Management | ClassificationEquity | Family insurance coverage | Flexible work hours | Health teleconsultations | Hybrid work setupMid-level Full TimeHybrid - Bangalore, India R19h ago