AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build and monitor inference tests
- Collaborate with cross-functional teams
- Design model serving architectures
- Develop inference algorithms
- Diagnose serving bottlenecks
- Establish performance metrics
- Integrate serving frameworks into production
- Optimize batch processing
- Optimize inference pipelines
- Optimize memory usage
- Prepare test datasets and simulation scenarios
Perks/Benefits
Skills/Tech-stack
Compute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Embedded inference | Expert parallelism | Flash Attention | GPU Kernels | High Throughput | Inference Optimization | Inference Systems | KV cache | Low Latency | Machine Learning | Memory Optimization | Metal Shading Language | Mobile optimization | Model Serving | NLP | Pipeline parallelism | Pruning | Python | Quantization | Shading language | Speculative decoding | Tensor Parallelism | Tensor Processing Units | Tensor processing | Vision Transformers
Education
Roles
Related jobs
-
Lead Database Engineer INR 1800K-3000KASH | AWR | AWS CloudWatch | AWS RDS | Access GovernanceHealth and life insurance | Hybrid schedule | Paid time off | Pension and retirement benefits | Professional development supportSenior-level Full TimeHyderabad, India R8h ago
-
Senior Data Engineer PHP 80K-160KApache Airflow | Apache Spark | BigQuery | Cloud platform | Data ModelingAnnual leave | Birthday leave | Flexible work arrangement | Hybrid work arrangement | Learning leaveSenior-level Full TimeTaguig, Metro Manila, Philippines R17h ago
-
Machine Learning Lead Analyst - HIH - Evernorth INR 2500K-4500KAPI Integration | AWS | Algorithms | C# | Cloud platformFull-time employment | Remote work flexibilitySenior-level Full TimeHIH - Hyderabad, India R17h ago
-
Machine Learning/ AI Engineer EUR 60K-76KA/B | A/B Testing | APIs | AWS Lambda | Amazon S3Flexible working hours | Free Lunches | Free breakfast | Free fruit | Free snacksMid-level Full TimeMadrid - Busuu, Spain R17h ago
-
Data Engineer, Fraud EUR 98K-98KApache Airflow | Apache Superset | CI/CD | ClickHouse | DBTFlexible work hours | Global co-working access | Retention bonus scheme | Team events and gatherings | Workstation providedSenior-level Full TimeLisbon, Lisbon, Portugal - Remote R17h ago
-
Senior Engineer, Customer Data Operations INR 3000K-4000KAWS | Airflow | Anomaly Detection | Apache Spark | CI/CDESG initiatives | Employee incentive programs | Flexible vacation | Headspace app access | Hybrid work modelSenior-level Full TimeIndia, Bengaluru, Karnataka R17h ago
-
Senior Systems Engineer, Storage - DGX Cloud USD 208K-414KAlerting | Algorithms | Ansible | Argo CD | CI/CDSenior-level Full TimeUS, CA, Remote, United States R17h ago
-
Machine Learning Engineer, Applied AI (Hybrid) MXN 972K-1215KAWS | Argo CD | Argo Workflows | CI/CD | Data AnalysisEmployee resource groups | Flexible time off | Hybrid work schedule | Meal reimbursement | Paid HolidaysMid-level Full TimeMexico City R22h ago
-
Machine Learning Engineer USD 140K-220KApache Spark | Azure | Azure Machine Learning | CI/CD | Cloud StorageCareer development opportunities | High responsibilitySenior-level Full TimeUnited States - Remote R1d ago
-
AI Services | Anthropic API | Automated testing | Azure AI | Azure AI ServicesCertification opportunities | Collaborative team | Continuous learning | Cross-industry projects | Flexible work arrangementsSenior-level Full TimeBerlin, Germany R1d ago
-
A/B | A/B Testing | API Design | Alerting | AzureEnglish communication overlap Americas and Europe time zones | Flexible independent contractor retainer | Paid time off | Performance-based bonus | Remote workSenior-level Contract Full TimeUkrainka, Kyiv Oblast, Ukraine - Remote R1d ago
-
AI Engineer (Hybrid) USD 130K-180KAPI Development | Backend Development | Cloud Computing | Computer Vision | Deep learningHybrid work environmentSenior-level Full TimeGiza, El Omraniya, Egypt R1d ago
-
Data Engineer PLN 23K-23KAPI Development | AWS | Azure | Best practices | BioinformaticsConference opportunities | Fully remote work | Modern equipment provided | Professional development budgetSenior-level Full TimeWarszawa, Poland R1d ago
-
AI Architect EUR 36K-45KAIOps | AWS | Amazon SageMaker | Artificial Intelligence | AzureCertification support | Hybrid work | Indefinite contract | Jornada intensiva in July and August | Paid trainingSenior-level Full TimeRemote, Spain R1d ago
-
(Senior) AI Engineer (all genders) EUR 65K-75KAgile methods | Cloud Computing | Containerization | DevOps | DockerAdditional vacation | E-learning support | Fitness benefits | Flexible work options | Regular internal eventsMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R1d ago
-
Chatbot LLM Engineer (all genders) EUR 38K-66KAgile methods | LLM Operations | Language Models | Large Language Models | Prompt engineeringE-learning support | Extra time off | Family-friendly support | Fitness benefits | Flexible working hoursMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R1d ago
-
Mid-level Full TimeNew Delihi, India; Hybrid (Bengaluru, Karnataka, … R1d ago
-
Senior Data Engineer (Modern Data Platform & AI) (all genders) | Berlin, hybrid or remote EUR 68K-90KAWS | Airflow | Amazon Athena | Amazon S3 | Apache SparkDiscounted BVG ticket | Hybrid work | Jobrad | Mental health support | Remote work optionSenior-level Full TimeGermany - Remote R1d ago
-
Lead AI Engineer INR 2475K-4500KAWS | Angular | Azure | Clojure | Distributed SystemsHybrid setup | Paid time off | Worker insuranceSenior-level Full TimePune, Maharashtra, India R1d ago
-
Data Processing | GRPC | GraphQL | Large Scale Data | Large-scaleDirect product impact | Experimentation | Fast-paced startup culture | Rapid iteration | Remote OKMid-level Full TimeNew York, New York, United States R1d ago
-
Consultant Senior/Tech Lead- Fullstack (Java/Python/Gen AI, React, Angular, Vue) - F/H/N EUR 35K-39KAngular | Code review | Domain-Driven Design | Generative AI | JavaFlexible work hours | Laptop choice | Meal vouchers | Mentorship | Paid time offSenior-level Full TimeParis, IDF, France R1d ago
-
AI Research Engineer USD 216K-332KBlockchain | Cryptography | Deep learning | Distributed Systems | Federated LearningMid-level Full TimeAnywhere R1d ago
-
AI Researcher USD 247K-340KCryptography | Distributed Systems | Distributed Training | Distributed inference | Federated LearningMid-level Full TimeAnywhere R1d ago
-
Principal Data & AI Solutions (m/w/d) GBP 63K-110KArtificial Intelligence | Big Data | Business Development | Client consulting | Cloud ComputingTravel opportunities | Work from home optionExecutive-level Full TimeRemote job R1d ago
-
AWS | Azure | CI/CD | Cloud platform | Data PipelinesDental insurance | Family support benefits | Flexible spending accounts | Flexible time off | Health insuranceSenior-level Full TimeCanada R1d ago