AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build and monitor inference tests in simulated and production environments
- Create test datasets and simulation scenarios for deployment
- Design model serving architectures for low latency high throughput
- Diagnose serving bottlenecks using performance metrics
- Evaluate model efficiency and iterate on inference algorithms
- Integrate inference frameworks into edge and on device production pipelines
- Optimize batching and reduce network delays
- Optimize memory usage in inference pipelines
Perks/Benefits
Skills/Tech-stack
Compute Shaders | Custom Compute Shaders | Data Pipelines | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelism | Flash Attention | GPU Kernels | High Throughput | Inference Optimization | Inference Systems | KV cache | Kernel optimization | Latency optimization | Low Latency | Memory Optimization | Metal Shading Language | Mobile Devices | Model Serving | Performance Benchmarking | Pipeline parallelism | Pruning | Quantization | Shading language | Speculative decoding | Tensor Parallelism | Throughput Optimization | Vision Transformers
Education
Roles
Related jobs
-
Senior Data Engineer PHP 80K-160KApache Airflow | Apache Spark | BigQuery | Cloud platform | Data ModelingAnnual leave | Birthday leave | Flexible work arrangement | Hybrid work arrangement | Learning leaveSenior-level Full TimeTaguig, Metro Manila, Philippines R13h ago
-
Data Engineering Lead Analyst - HOH - Evernorth INR 1500K-2500KAWS Glue | AWS RDS | Amazon Athena | Amazon Redshift | Amazon S3Remote work flexibilitySenior-level Full TimeHIH - Hyderabad, India R13h ago
-
(Senior) AI Engineer (all genders) EUR 65K-75KAgile methods | Cloud Computing | Containerization | DevOps | DockerAdditional vacation | E-learning support | Fitness benefits | Flexible work options | Regular internal eventsMid-level Full TimeBremen, Munich, Mannheim, Mainz, Berlin, Remote R1d ago
-
Senior Data Engineer (Modern Data Platform & AI) (all genders) | Berlin, hybrid or remote EUR 68K-90KAWS | Airflow | Amazon Athena | Amazon S3 | Apache SparkDiscounted BVG ticket | Hybrid work | Jobrad | Mental health support | Remote work optionSenior-level Full TimeGermany - Remote R1d ago
-
AI Research Engineer USD 216K-332KBlockchain | Cryptography | Deep learning | Distributed Systems | Federated LearningMid-level Full TimeAnywhere R1d ago
-
AI Researcher USD 247K-340KCryptography | Distributed Systems | Distributed Training | Distributed inference | Federated LearningMid-level Full TimeAnywhere R1d ago
-
AWS | Azure | CI/CD | Cloud platform | Data PipelinesDental insurance | Family support benefits | Flexible spending accounts | Flexible time off | Health insuranceSenior-level Full TimeCanada R1d ago
-
Data Scientist - Production Engineering USD 140K-175KAWS Glue | Amazon Athena | Amazon ECS | Amazon S3 | Apache Airflow401k match | Annual Company Conference | Childcare support | Continued Education Reimbursements | Flexible time offSenior-level Full TimeRemote (US) R2d ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KAgent Orchestration | Agent systems | Artificial Intelligence | Autonomous Reasoning | Fine TuningSenior-level Full TimeRemote-USA R2d ago
-
Staff Software Engineer, AI Developer Tools USD 180K-245KAPI Design | Agent systems | CI/CD | Compliance | Data PrivacySenior-level Full TimeDenver, CO;San Francisco, CA;New York, NY;Seattle, … R2d ago
-
Senior Data Engineer USD 60K-180K800-53 | ABAC | AWS Glue | AWS IAM | AWS S3Public Trust clearance supported | Remote work | Training support | Weekly office hours supportSenior-level Full TimeRemote - Public Trust clearance required R2d ago
-
Staff Software Engineer, Big Data Storage USD 177K-364KApache Flink | Apache Hive | Apache Iceberg | Apache Spark | Column BackfillSenior-level Full TimePalo Alto, CA, US; Remote, US R2d ago
-
Senior Software Engineer, Data Authoring Platform USD 196K-230KAPI Design | Anomaly Detection | Automated testing | DSL | Data GovernanceEmployee travel credits | Remote eligibleSenior-level Full TimeRemote USA R2d ago
-
Lead AI Engineer, Business Operations (Hybrid or Remote USD 150K-220KAPI Design | Backend Development | Cloud Platforms | Evaluation Frameworks | Fine Tuning401k company match | Career advancement opportunities | Dental insurance | Flexible time off policy | Life insuranceSenior-level Full TimeDallas, Texas, United States; United States R2d ago
-
Apache Spark | Data Pipelines | Data Processing | ETL | PythonFlexible work setup | Hybrid work environmentMid-level Full TimeNew York, New York; Hybrid; London, … R2d ago
-
ML / Data Analyst BGN 44K-52KData Pipelines | Data Quality | Machine Learning | Regex | SQLAdditional paid leave | Employee assistance program | Flexible working hours | Food vouchers | Hybrid work modelMid-level Full TimeSofia, 23, Bulgaria R3d ago
-
AWS | Ablation Studies | CI/CD | CUDA | DDP25 days annual leave | Career growth opportunities | Fully remote flexibility | High-ownership environment | Hybrid office accessSenior-level Full TimeNetherlands R3d ago
-
AWS | Ablation Studies | CI/CD | CUDA | DDPAnnual leave | Career growth opportunities | High-ownership environment | Hybrid office access | Public holidaysSenior-level Full TimeIreland R3d ago
-
AWS | CI/CD | CUDA | DDP | Deep learningAnnual leave | Career growth opportunities | Hybrid work option | Public holidays | Remote work optionSenior-level Full TimeSwitzerland R3d ago
-
AWS | Ablation Studies | CI/CD | CUDA | DDPAnnual leave | Career growth opportunities | Hybrid office option | Planning Sessions | Public holidaysSenior-level Full TimeFrance R3d ago
-
AWS | CI/CD | CUDA | DDP | DeepSpeedAnnual paid leave | Career growth opportunities | High-ownership environment | Hybrid office access | Public holidaysSenior-level Full TimeGermany R3d ago
-
AWS | Ablation Studies | CI/CD | CUDA | DDPAnnual leave | Career growth | Hybrid office option | Remote work | Team eventsSenior-level Full TimeSpain R3d ago
-
BI Engineer (Hybrid or Onsite) EUR 42K-54KAWS | Azure | CI/CD | CouchDB | DAXCareer development | Employee wellness program | Private health insurance | Provision of tools and equipmentMid-level Full TimeMarousi, Attica, Greece R3d ago
-
AWS Lambda | BentoML | Cost Optimization | Docker | EvaluationCo-working budget | Equity or stock options | Fully remote | Home office setup budget | Paid time offSenior-level Full TimeBrazil R3d ago
-
Inference Engineer USD 180K-250KCUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning401k | Commuter allowance | Dental insurance | Flexible PTO | Health insuranceMid-level Full Time*HQ - San Francisco, CA R3d ago