AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build and monitor inference tests
- Collaborate with cross-functional teams
- Design model serving architectures
- Develop inference algorithms
- Diagnose serving bottlenecks
- Establish performance metrics
- Integrate serving frameworks into production
- Optimize batch processing
- Optimize inference pipelines
- Optimize memory usage
- Prepare test datasets and simulation scenarios
Perks/Benefits
Skills/Tech-stack
Compute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Embedded inference | Expert parallelism | Flash Attention | GPU Kernels | High Throughput | Inference Optimization | Inference Systems | KV cache | Low Latency | Machine Learning | Memory Optimization | Metal Shading Language | Mobile optimization | Model Serving | NLP | Pipeline parallelism | Pruning | Python | Quantization | Shading language | Speculative decoding | Tensor Parallelism | Tensor Processing Units | Tensor processing | Vision Transformers
Education
Roles
Related jobs
-
Featured Feat. AI Engineer (MTS) USD 160K-300KAPI Development | AWS | Amazon Web Services | Deep learning | FastAPIMentoring | Open source contributions | Remote workMid-levelRemote R17d ago
-
Featured Feat. Data Engineer USD 80K-150KData Monitoring | Data Quality | Data Validation | ELT | ETLRemote workEntry-levelRemote R17d ago
-
Solutions Architect USD 165K-216KADLS | AWS | Airflow | Azure | CassandraAdvanced training | Certification support | Collaborative culture | Generous paid time off | Professional development opportunitiesSenior-level Full TimeLATAM - Remote R3h ago
-
Head of AI - JT AI Labs (M/W/D) EUR 85K-100KData Annotation | Data Pipelines | Data Quality | Data Security | Deep learningAnnual learning budget | Family care policy | Flexible work hours | Free yoga lessons | Health insuranceExecutive-level Full TimeParis, IDF, France R3h ago
-
Software Engineer, Data Infrastructure PLN 300K-347KAWS | Apache Spark | Azure | Data Ingestion | Data LakeCareer growth budget | Dental coverage | Family forming support | Fertility healthcare support | Group life insuranceSenior-level Full TimeWarsaw R6h ago
-
Senior-level Full TimeVitrolles, Provence-Alpes-Côte d'Azur, France R6h ago
-
Applied AI Engineer, Agentic Systems USD 115K-192K.NET | APIs | Anthropic | CrewAI | Evaluation FrameworksAI and productivity tools access | Remote work accessSenior-level Full TimeRemote - United States R9h ago
-
Senior Industrial Engineer, Process Optimization USD 100K-120K5S | AutoCAD | Cause analysis | Cost modeling | Excel401k | Dental insurance | Disability insurance | Flexible spending account | Health savings accountSenior-level Full TimeBethlehem, PA, United States R14h ago
-
Senior-level Full TimeJakarta, ID R16h ago
-
Data Engineer - EY GDS Spain - Hybrid EUR 65K-65KAWS | Apache Airflow | Apache Parquet | Avro | AzureContinuous learning programs | Flexible work-life integration | Hybrid work model | Volunteering opportunities | Well-being programsMid-level Full TimeMalaga, ES, 29590 R16h ago
-
Senior AI Engineer BGN 90K-105KAPI Design | AWS | Access Control | Amazon Bedrock | Amazon OpenSearchFully distributed remote | Paid holiday | Professional development | State-of-the-art hardware | Training & CertificationsSenior-level Full TimeSofia, Sofia City Province, Bulgaria - … R16h ago
-
Automation and AI Solutions Lead INR 2500K-3500KAWS Bedrock | Agent systems | Amazon AgentCore | Async Programming | CI/CDFlexible vacation | Headspace access | Hybrid work | Mental health days | Retirement savingsSenior-level Full TimeIndia, Bengaluru, Karnataka R16h ago
-
Anomaly Detection | CI/CD | Clustering | DBT | Data ModelingFlexible schedule | Remote work | Work from homeSenior-level Full TimeMetro Manila, Philippines - Remote R16h ago
-
AWS | Airflow | Azure | CI/CD | DagsterFlexible US business hours | Remote workMid-level Full TimePakistan - Remote R16h ago
-
APIs | Agile | Analytics | Automation | Cost analysisComprehensive healthcare | Flexible time off | Retirement plan | Support for working parents | Tuition reimbursementMid-level Full TimeHA3-Gurugram - DLF Cyber City, India R16h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash AttentionEnglish support | Remote workSenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KComputer Vision | Deep learning | Diffusion Models | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelismCareer growth | Collaborative research environment | English communication support | Remote work opportunitySenior-level Full TimeRemote job R19h ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Custom Compute Shaders | Data Pipelines | Diffusion Models | Distributed Inference SystemsRemote workSenior-level Full TimeRemote job R19h ago
-
Senior Data Engineer I (Postgres DBA) USD 123K-184KAWS | Amazon RDS | Apache Airflow | CI/CD | CloudWatch401k matching | Doordash DashPass | Equity grants eligibility | Flexible remote environment | Lifestyle benefits platformSenior-level Full TimeRemote R1d ago
-
AWS | Airbyte | Amazon Redshift | Apache Airflow | CI/CDHybrid work | On-call rotation | Remote-first workMid-level Full TimeCape Town, Western Cape, South Africa R1d ago
-
AWS | Agile | Azure | CI/CD | DevOpsEnglish classes | Free office food and drinks | Internal training | Paid certifications | Paid time offSenior-level Full TimeZaragoza, Spain R1d ago
-
Apigee | Artificial Intelligence | Bash | BigQuery | Cloud HostingEnglish classes | Free Office Meals and Drinks | Free parking | Paid certifications | Paid vacationMid-level Full TimeMadrid, Spain R1d ago
-
API Gateway | AWS | Authentication | Bash | CI/CDAccess to public holidays | English classes | Learning platform access | Professional development | Referral programSenior-level Full TimeCórdoba, Córdoba Province, Argentina R1d ago