AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build and monitor inference tests
- Collaborate with cross-functional teams
- Design model serving architectures
- Develop inference algorithms
- Diagnose serving bottlenecks
- Establish performance metrics
- Integrate serving frameworks into production
- Optimize batch processing
- Optimize inference pipelines
- Optimize memory usage
- Prepare test datasets and simulation scenarios
Perks/Benefits
Skills/Tech-stack
Compute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Embedded inference | Expert parallelism | Flash Attention | GPU Kernels | High Throughput | Inference Optimization | Inference Systems | KV cache | Low Latency | Machine Learning | Memory Optimization | Metal Shading Language | Mobile optimization | Model Serving | NLP | Pipeline parallelism | Pruning | Python | Quantization | Shading language | Speculative decoding | Tensor Parallelism | Tensor Processing Units | Tensor processing | Vision Transformers
Education
Roles
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
Featured Feat. Senior Software Engineer USD 80K-170KAlgorithms | C++ | Data Structures | Go | JavaContractor flexibility | Remote workSenior-level ContractRemote R10d ago
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R10h ago
-
Data Engineer USD 74K-133KAgile | Apache Airflow | BigQuery | Cloud Composer | Cloud Data401k retirement plan | Dental insurance | Disability insurance | Flexible time off | Health insuranceMid-level Full TimeLisle, IL, United States R12h ago
-
AWS | Airflow | DBT | Fine Tuning | Language ModelsBonuses | Disability insurance | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, India R14h ago
-
API Testing | Cypher | Data Quality | DataOps | DevOpsBenefits | Competitive pay | Growth opportunity | Remote work | Travel requiredSenior-level Full TimeReston, VA, United States R14h ago
-
Principal Engineer - Data Platform USD 221K-387KAWS | Airflow | Apache Hive | Apache Iceberg | Apache ImpalaRemote workSenior-level Full TimeSanta Clara, California, United States R14h ago
-
AI Engineer H/F - CDI EUR 50K-65KAI Agents | Agent systems | Cloud Computing | Deep learning | Fine TuningCooptation bonus | Equipment bonus | Flexible remote work | Health insurance | Meal vouchersMid-level Full TimeParis, IDF, France R16h ago
-
AWS | Azure | Data Governance | Data Marts | Data ModelingCSE | Career development opportunities | Cooptation bonus | Diversity initiatives | Employee representative councilSenior-level Full TimeAix-en-Provence, Provence-Alpes-Côte d'Azur, France R16h ago
-
Data Engineer Sr. BRL 235K-396KAmazon Redshift | Data Modeling | Data Pipelines | Data Storytelling | Data VisualizationRemote work | Work-life balance flexibilitySenior-level Full TimeRemote, Brazil R17h ago
-
AI Integration Specialist (HubSpot & Actionstep Automation) AU Dayshift, Remote Philippines AUD 28K-30KAI Agents | API Integration | Actionstep | CRM Workflow | CRM Workflow AutomationFlexible working hours PT | Remote work | WFHMid-level Full TimeRemote R17h ago
-
Senior Database Engineer - #3190 (Remote) INR 1500K-2700KAWS IAM | AWS KMS | AWS RDS | Amazon Aurora | BashOn-call rotation | Remote workSenior-level Full TimeChennai, India R18h ago
-
A/B | A/B Testing | AI Search | Azure | Azure AIFlexible working hours | Modern office | Multisport card | Private medical care | Unlimited access to Microsoft technologyMid-level ContractRemote R18h ago
-
Software Engineer SRE (Site Reliability Engineer) INR 1600K-3000KAWS | Ansible | Apache Airflow | Apache Kafka | Apache SparkSenior-level Full TimeBangalore, India Office (BANGALORE) R18h ago
-
API Development | AWS | Amazon SageMaker | CI/CD | Cloud platformAnnual leave | Employee referral program | HMO coverage | Night differential pay | Remote workSenior-level Full TimeRemote R18h ago
-
Anthropic API | AutoGluon | CUDA | CatBoost | Cloud platformRemote workMid-level Full TimeRemote R19h ago
-
GCP BigQuery,DevOps,Credit Card Domain/Associate Director, Data and Analytics Specialist INR 650K-900KApache Airflow | Apache Kafka | BigQuery | CI/CD | Cloud ComposerMid-level Full TimeHyderabad, Telangana, India R20h ago
-
AWS | CI/CD | Dataiku | GitLab CI | HDFSEmployee representative council | Health insurance | Meal vouchers | Profit sharing | Referral bonusSenior-level Full TimeVilleneuve-d'Ascq, Hauts-de-France, France R23h ago
-
Data Scientist confirmé / AI Engineer EUR 50K-55KAzure | CI/CD | Docker | Docker Compose | GCPHealth insurance | Telework | Ticket restaurant | Works CouncilMid-level Full TimeCourbevoie, IDF, France R23h ago
-
Python, GenAI, LLM/Consultant Specialist INR 3700K-5000KAccess Management | Active Directory | Active Directory Domain Services | Agent Frameworks | Amazon Web ServicesFlexible working | Professional developmentMid-level Full TimePune, Maharashtra, India R1d ago
-
AWS Glue | AWS Lambda | AWS S3 | Access Control | Data GovernanceCareer growth opportunities | Collaborative and inclusive work environment | Diverse and inclusive culture | Flexible work arrangements | Permanent remote working modelSenior-level Full TimeCanada R1d ago
-
AI Agents | API Integration | Backend Development | Cloud Platforms | ContainerizationCoworking space access | Engineering autonomy | Healthcare coverage | Home-office equipment provided | Remote workMid-level Full TimeSpain R1d ago
-
AI | AI Agents | Backend Development | Cloud Platforms | ContainersCoworking spaces | Flexible work location | Fully remote | Healthcare coverage | Home-office equipmentMid-level Full TimeGermany R1d ago
-
Lead GIS Data Engineer PLN 206K-282KArcGIS | ArcGIS Enterprise | ArcGIS Network Analyst | ArcGIS Pro | ArcpyBonus | Flexible working hours | Life insurance | Medical coverage | Paid time offSenior-level Full TimePoland Home Office, Poland R1d ago
-
Associate Engineer-Data Analyst(Hybrid) INR 700K-900KAerospace Engineering | Data Analysis | Data Visualization | Mechanical Engineering | Power BICar lease programme | Contingency leave | Employee scholar programme | Group health insurance | Group personal accident insuranceMid-level Full TimeIN-KA-BENGALURU-NORTHGATE ~ Sy No 2/2 Venkatala … R1d ago