AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Analyze computational efficiency
- Create test datasets and simulation scenarios
- Design model serving architectures
- Develop inference algorithms
- Establish performance metrics
- Identify and resolve bottlenecks
- Implement inference pipelines
- Integrate serving frameworks into edge pipelines
- Monitor production inference KPIs
- Optimize inference strategies
Perks/Benefits
Skills/Tech-stack
Diffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash Attention | GPU Kernels | GPU Programming | Inference Optimization | Inference Systems | KV cache | Language Processing | Machine Learning | Metal Shading Language | Mobile GPU | Mobile GPU Programming | Model Serving | Natural Language | Natural Language Processing | Pipeline parallelism | Pruning | Quantization | Shader Programming | Shading language | Speculative decoding | Tensor Parallelism | Vision Transformers
Education
Related jobs
-
Featured Feat. AI Engineer (MTS) USD 160K-300KAPI Development | AWS | Amazon Web Services | Deep learning | FastAPIMentoring | Open source contributions | Remote workMid-levelRemote R17d ago
-
Head of AI - JT AI Labs (M/W/D) EUR 85K-100KData Annotation | Data Pipelines | Data Quality | Data Security | Deep learningAnnual learning budget | Family care policy | Flexible work hours | Free yoga lessons | Health insuranceExecutive-level Full TimeParis, IDF, France R3h ago
-
Senior AI Engineer BGN 90K-105KAPI Design | AWS | Access Control | Amazon Bedrock | Amazon OpenSearchFully distributed remote | Paid holiday | Professional development | State-of-the-art hardware | Training & CertificationsSenior-level Full TimeSofia, Sofia City Province, Bulgaria - … R16h ago
-
Automation and AI Solutions Lead INR 2500K-3500KAWS Bedrock | Agent systems | Amazon AgentCore | Async Programming | CI/CDFlexible vacation | Headspace access | Hybrid work | Mental health days | Retirement savingsSenior-level Full TimeIndia, Bengaluru, Karnataka R16h ago
-
APIs | Agile | Analytics | Automation | Cost analysisComprehensive healthcare | Flexible time off | Retirement plan | Support for working parents | Tuition reimbursementMid-level Full TimeHA3-Gurugram - DLF Cyber City, India R16h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KComputer Vision | Deep learning | Diffusion Models | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelismCareer growth | Collaborative research environment | English communication support | Remote work opportunitySenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Custom Compute Shaders | Data Pipelines | Diffusion Models | Distributed Inference SystemsRemote workSenior-level Full TimeRemote job R19h ago
-
Apigee | Artificial Intelligence | Bash | BigQuery | Cloud HostingEnglish classes | Free Office Meals and Drinks | Free parking | Paid certifications | Paid vacationMid-level Full TimeMadrid, Spain R1d ago
-
Agentic AI | Copilot Studio | Data Analysis | Data Visualization | DatabricksHybrid work environment | Learning opportunities | MentorshipSenior-level Full TimeMadrid, Spain R1d ago
-
(Senior) AI Engineer (all genders) EUR 65K-75KCloud Platforms | Containerization | DevOps | Docker | Language Models30 days vacation | E-learning support | Employee participation | Fitness benefits | Flexible work optionsMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R1d ago
-
AI Research Engineer USD 230K-385KApplied cryptography | Blockchain Protocols | Decentralized networks | Distributed Systems | Federated LearningEntry-level Full TimeAnywhere R1d ago
-
AI Researcher USD 250K-350KCryptography | Distributed Computing | Distributed inference | Federated Learning | Incentive designMid-level Full TimeAnywhere R1d ago
-
AI Platform Engineer USD 124K-251KAWS | Azure | Blockchain Security | CI/CD | Cloud NativeCourses and training | Fitness gym memberships | Health reimbursements | Meditation supportMid-level Full TimeRemote R1d ago
-
Algorithm Design | Amazon Web Services | Continuous Deployment | Continuous integration | CouchbaseEquipment program | Flexible schedule | Open time-off policy | Paid time off | Remote workMid-level Full TimeRemote job R1d ago
-
Software Engineer, Machine Learning USD 213K-293KAI ethics | API Design | Agent Orchestration | Artificial Intelligence | Bias MitigationSenior-level Full TimeSunnyvale, CA | Remote, US | … R2d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Computer Vision | Data labeling | Deep learningRemote workMid-level Full TimeUnited States - Remote R2d ago
-
LLM Fine-Tuning Engineer USD 150K-270KAdapter-Tuning | DPO | Dataset curation | Efficient Attention | EvaluationHealth insurance | Paid time off | Remote workMid-level Full TimeUnited States - Remote R2d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess Optimization | Attention Optimization | Benchmarking | C++ | Compiler optimizationMid-level Full TimeUnited States - Remote R2d ago
-
API Design | AWS | AWS Lambda | Agentic AI | Amazon EC2Senior-level Full TimeOffice Location or Remote - USA R2d ago
-
API Design | AWS | Agentic AI | Cypher | Data ArchitectureSenior-level Full TimeOffice Location or Remote - USA R2d ago
-
Sr. GTM AI Architect USD 150K-200KAPIs | Artificial Intelligence | Automation | Data Governance | Data workflows401k match | Flexible time off | Health benefits | Mental health program | Paid HolidaysSenior-level Full TimeRemote R2d ago
-
Principal Engineer - GenAI Applications & MLOps USD 175K-242KAWS | Bigtable | Data integration | Distributed Systems | Event ProcessingRemote US basedSenior-level Full TimeUS Remote R2d ago