AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build inference pipelines
- Design model serving architectures
- Develop evaluation frameworks
- Diagnose serving bottlenecks
- Implement distributed inference techniques
- Integrate inference frameworks into production pipelines
- Monitor performance metrics
- Optimize inference strategies
- Optimize latency and throughput
- Optimize memory usage
- Prepare test datasets and simulation scenarios
- Run inference tests
Perks/Benefits
Skills/Tech-stack
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism | Flash Attention | GPU Kernels | GPU clusters | High Throughput | Inference Optimization | KV cache | Low Latency | Machine Learning | Memory Optimization | Metal Shading Language | Mobile optimization | Model Serving | NLP | NLP Research | On-device Inference | Pipeline parallelism | Pruning | Quantization | Shading language | Speculative decoding | Tensor Parallelism | Vision Transformers
Education
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
AWS | Airflow | DBT | Fine Tuning | Language ModelsBonuses | Disability insurance | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, India R9h ago
-
AI Engineer H/F - CDI EUR 50K-65KAI Agents | Agent systems | Cloud Computing | Deep learning | Fine TuningCooptation bonus | Equipment bonus | Flexible remote work | Health insurance | Meal vouchersMid-level Full TimeParis, IDF, France R11h ago
-
Data Scientist confirmé / AI Engineer EUR 50K-55KAzure | CI/CD | Docker | Docker Compose | GCPHealth insurance | Telework | Ticket restaurant | Works CouncilMid-level Full TimeCourbevoie, IDF, France R19h ago
-
AI Agents | API Integration | Backend Development | Cloud Platforms | ContainerizationCoworking space access | Engineering autonomy | Healthcare coverage | Home-office equipment provided | Remote workMid-level Full TimeSpain R23h ago
-
AI | AI Agents | Backend Development | Cloud Platforms | ContainersCoworking spaces | Flexible work location | Fully remote | Healthcare coverage | Home-office equipmentMid-level Full TimeGermany R23h ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Cross Platform Inference | Cross-platform | DSPCareer growth potential | Full-time remote work | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Data Quality | Data labeling | Data quality monitoring100 percent remote | Career growth | Full-time employment | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R1d ago
-
AWS | Alerting | Autogen | Data Ingestion | Data PreprocessingSenior-level Full TimeBangalore - Carina, India R1d ago
-
LLM Engineer USD 100K-150KAdapter methods | DPO | Deep reinforcement learning | Distributed Training | Efficient AttentionBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KDPO | Deep learning | Distributed Training | Efficient Attention | Efficient Fine TuningRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Solution Architect, AI 解決方案架構師 (內湖瑞光) TWD 310K-480KAI Agent | AI Foundry | AI Search | API Gateway | AWS BedrockSenior-level Full TimeTaipei Neihu, Taiwan R1d ago
-
Principal Applied AI Engineer, Finance USD 193K-340KAPI Development | AWS | Bias Mitigation | CI/CD | Churn modeling401k matching | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Full TimeVirtual Office (Massachusetts), United States R1d ago
-
Mid-level Full TimeRemote - France R1d ago
-
Mid-level Full TimeIN Virtual, India R1d ago
-
Data Engineer USD 72K-130KAI/ML | Analytics engineering | Azure DevOps | Bronze Silver Gold | CI/CD401k contribution | Career development opportunities | Comprehensive benefits package | Equity stock purchase | Incentive and recognition programsMid-level Full TimePrimary location: Eden Prairie, MN R1d ago
-
Deep Learning Compiler CI/Infrastructure Engineer CNY 160K-240KAI Agents | Agent workflows | Artifact management | Automated triage | AutomationGenerous benefits packageSenior-level Full TimeChina, Shanghai R1d ago
-
Senior ML Engineer INR 4000K-5876KA/B | A/B Testing | AWS | Amazon SageMaker | AzureHybrid work model | Mentorship | Remote work option | Travel as neededSenior-level Full TimeAPAC - India - Bengaluru - … R1d ago
-
NLP Engineer USD 72K-130KArtifact Repositories | Artifactory | C# | CI/CD | Containerization401k contribution | Career development opportunities | Comprehensive benefits package | Equity stock purchase | Incentive and recognition programsMid-level Full TimePrimary location: San Diego, CA R1d ago
-
Oliver Wyman - AI Engineer, Singapore SGD 115K-170KAPIs | AWS | Agentic Workflows | Automated testing | Bias MitigationMid-level Full TimeSingapore - Marina View R1d ago
-
AI Engineer - Melbourne AUD 92K-125KAPI Development | AWS | Automated testing | CI/CD | Cloud platformMid-level Full TimeMelbourne - 727 Collins, Australia R1d ago
-
Mid-level Full TimeSydney - Barangaroo, Australia R1d ago
-
Artificial Intelligence | Compliance | Controls management | Data Analysis | Data VisualizationFamily support benefits | Hybrid work model | Learning and professional development | Wellbeing benefitsSenior-level Full TimeLocation(s): Belfast, Northern Ireland, United Kingdom R1d ago