AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Analyze computational efficiency and diagnose bottlenecks
- Apply empirical research to improve inference performance
- Build and monitor inference tests in simulated and production environments
- Create test datasets and simulation scenarios
- Design model serving architectures
- Establish performance metrics and benchmarks
- Integrate inference frameworks into edge and on device pipelines
- Optimize inference latency throughput and memory usage
Perks/Benefits
- Career growth
- Collaborative research environment
- English communication support
- Remote work opportunity
Skills/Tech-stack
Diffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelism | Flash Attention | GPU Kernel Development | High Throughput | Inference Optimization | Inference Systems | KV cache | Kernel development | Learning models | Low Latency | Machine Learning | Machine Learning Models | Memory Optimization | Model Serving | Model architectures | NLP | Parallel Computing | Performance Benchmarking | Pipeline parallelism | Pruning | Quantization | Speculative decoding | Tensor Parallelism | Vision Transformers
Education
Related jobs
-
Machine Learning/ AI Engineer EUR 60K-76KA/B | A/B Testing | APIs | AWS Lambda | Amazon S3Flexible working hours | Free Lunches | Free breakfast | Free fruit | Free snacksMid-level Full TimeMadrid - Busuu, Spain R16h ago
-
Data Engineer, Fraud EUR 98K-98KApache Airflow | Apache Superset | CI/CD | ClickHouse | DBTFlexible work hours | Global co-working access | Retention bonus scheme | Team events and gatherings | Workstation providedSenior-level Full TimeLisbon, Lisbon, Portugal - Remote R16h ago
-
Machine Learning Engineer, Applied AI (Hybrid) MXN 972K-1215KAWS | Argo CD | Argo Workflows | CI/CD | Data AnalysisEmployee resource groups | Flexible time off | Hybrid work schedule | Meal reimbursement | Paid HolidaysMid-level Full TimeMexico City R21h ago
-
Machine Learning Engineer USD 140K-220KApache Spark | Azure | Azure Machine Learning | CI/CD | Cloud StorageCareer development opportunities | High responsibilitySenior-level Full TimeUnited States - Remote R1d ago
-
AI Services | Anthropic API | Automated testing | Azure AI | Azure AI ServicesCertification opportunities | Collaborative team | Continuous learning | Cross-industry projects | Flexible work arrangementsSenior-level Full TimeBerlin, Germany R1d ago
-
AI Engineer (Hybrid) USD 130K-180KAPI Development | Backend Development | Cloud Computing | Computer Vision | Deep learningHybrid work environmentSenior-level Full TimeGiza, El Omraniya, Egypt R1d ago
-
AI Architect EUR 36K-45KAIOps | AWS | Amazon SageMaker | Artificial Intelligence | AzureCertification support | Hybrid work | Indefinite contract | Jornada intensiva in July and August | Paid trainingSenior-level Full TimeRemote, Spain R1d ago
-
(Senior) AI Engineer (all genders) EUR 65K-75KAgile methods | Cloud Computing | Containerization | DevOps | DockerAdditional vacation | E-learning support | Fitness benefits | Flexible work options | Regular internal eventsMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R1d ago
-
Senior Data Engineer (Modern Data Platform & AI) (all genders) | Berlin, hybrid or remote EUR 68K-90KAWS | Airflow | Amazon Athena | Amazon S3 | Apache SparkDiscounted BVG ticket | Hybrid work | Jobrad | Mental health support | Remote work optionSenior-level Full TimeGermany - Remote R1d ago
-
Data Processing | GRPC | GraphQL | Large Scale Data | Large-scaleDirect product impact | Experimentation | Fast-paced startup culture | Rapid iteration | Remote OKMid-level Full TimeNew York, New York, United States R1d ago
-
AI Research Engineer USD 216K-332KBlockchain | Cryptography | Deep learning | Distributed Systems | Federated LearningMid-level Full TimeAnywhere R1d ago
-
AI Researcher USD 247K-340KCryptography | Distributed Systems | Distributed Training | Distributed inference | Federated LearningMid-level Full TimeAnywhere R1d ago
-
Principal Data & AI Solutions (m/w/d) GBP 63K-110KArtificial Intelligence | Big Data | Business Development | Client consulting | Cloud ComputingTravel opportunities | Work from home optionExecutive-level Full TimeRemote job R1d ago
-
AWS | Azure | CI/CD | Cloud platform | Data PipelinesDental insurance | Family support benefits | Flexible spending accounts | Flexible time off | Health insuranceSenior-level Full TimeCanada R1d ago
-
Amazon Web Services | Continuous Deployment | Continuous integration | Couchbase | Distributed SystemsEquipment program | Flexible scheduling | Open time-off policy | Performance bonuses | Remote workMid-level Full TimeRemote job R1d ago
-
Computational physics | Differential Equations | Linux | Monte Carlo | Numerical MethodsFreelance project-basedSenior-level FreelanceBrazil - Remote R1d ago
-
Senior-level Full TimePennsylvania-Remote, United States R2d ago
-
Senior AI Engineer USD 147K-198KA/B | A/B Testing | API Development | Agentic Workflows | B testingSenior-level Full TimePennsylanvia-Remote, United States R2d ago
-
Cloud Platform & AI Engineer INR 2200K-3500KAWS Bedrock | AWS Security | Amazon SageMaker | Amazon Web Services | Artificial IntelligenceHybrid work transition | Remote work flexibilityMid-level Full TimeRemote- India- Gurugram R2d ago
-
Senior GenAI Software Engineer (North America) USD 165K-230KA/B | A/B Testing | B testing | Debugging | EvaluationEquity | Health, dental, and vision benefits | In person team gatherings quarterly | Remote-first work | Wellness stipendsSenior-level Full TimeUnited States R2d ago
-
Senior Software Engineer, AI Developer Experience USD 202K-230KAPI Integration | Agentic Workflows | Artificial Intelligence | Code review | Command LineCareer coaching and support | In-office culinary options | Inclusive family building benefits | Long term savings or retirement plans | Mental health wellness and fitness benefitsSenior-level Full TimeNew York City R2d ago
-
Machine Learning Scientist, BioML USD 200K-330KAWS | Azure | Bioinformatics | Cloud Computing | Computational Biology401k employer match | Equity participation | Health, dental, vision insurance | Paid time off | Professional developmentMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R2d ago
-
Data Scientist - Production Engineering USD 140K-175KAWS Glue | Amazon Athena | Amazon ECS | Amazon S3 | Apache Airflow401k match | Annual Company Conference | Childcare support | Continued Education Reimbursements | Flexible time offSenior-level Full TimeRemote (US) R2d ago
-
Machine Learning Platform Engineer USD 135K-160KAmazon SageMaker | Apache Flink | C++ | CI/CD | Cloud PubSub401k match | Annual bonus | Company equipment provided | Company medical dental vision plans | Disability benefitsMid-level Full TimeAtlanta, GA preferred, Remote R2d ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KAgent Orchestration | Agent systems | Artificial Intelligence | Autonomous Reasoning | Fine TuningSenior-level Full TimeRemote-USA R2d ago