AI Research Engineer (Kernel & Inference Optimization)
Tasks
- Build and monitor inference tests
- Create test datasets and simulation scenarios
- Deploy inference pipelines
- Design model serving architectures
- Identify and resolve serving bottlenecks
- Integrate inference frameworks into production pipelines
- Optimize inference strategies
- Track performance metrics
Perks/Benefits
Skills/Tech-stack
Computer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash Attention | GPU Kernels | Inference Optimization | KV cache | Level optimization | Low-level optimization | Machine Learning | Memory Management | Mobile optimization | Model Serving | NLP | Neural Networks | On-device Inference | Pipeline parallelism | Pruning | Quantization | Speculative decoding | Tensor Parallelism | Vision Transformers
Education
Related jobs
-
Full Stack Software Engineer USD 150K-220KAPIs | Cloud | Distributed Systems | Embeddings | JavaScriptMid-level Full TimeRemote (US) R5h ago
-
Senior Data Scientist, Machine Learning USD 194K-218KAWS | Active Learning | Airflow | Amazon Redshift | Automated Labeling100% TelecommutingSenior-level Full TimeRedwood City, CA R7h ago
-
Senior-level Full TimeToronto, Canada R9h ago
-
Manager, AI Engineering USD 122K-152KAI Act | AI RMF | Agent Frameworks | Bias Testing | Data Lineage401k match | Business travel coverage | Dental insurance | Disability insurance | Employee assistance programMid-level Full TimePrinceton, New Jersey, United States; San … R9h ago
-
Member of Technical Staff (Storage) USD 185K-200KAI Assisted Development | C++ | Concurrency Control | Data replication | Distributed SystemsDental insurance | Flexible time off | Life and disability insurance | Medical insurance | Mental wellbeing benefitsSenior-level Full TimeNew York, NY R12h ago
-
Senior Manager, AI Perception and Skills USD 222K-347KArtificial Intelligence | Behavior Cloning | Computer Vision | Diffusion Models | Edge ComputingEmployee assistance program | Flexible work arrangements | Parental leave | Professional development | Relocation assistanceSenior-level Full TimeRemote R12h ago
-
Associate Director R&D AI Solutions & Analytics USD 156K-195KAPI Integration | Analytics | Artificial Intelligence | CDISC | Clinical data401k match | Business travel coverage | Dental insurance | Disability insurance | Employee assistance programMid-level Full TimePrinceton, New Jersey, United States; San … R13h ago
-
AI Engineer COP 41748K-43836KAWS | CI/CD | Cloud infrastructure | Data Ingestion | Data ProcessingAdvanced AWS Partnership training | Advanced engineering resources | Early access to emerging cloud capabilitiesMid-level Full TimeBogota, Colombia (Remote Friendly) R14h ago
-
Mid-level Full Time北京 R14h ago
-
Staff AI Engineer (Audio) USD 185K-235KAudio Processing | Classification metrics | Data Drift | Data analytics | Datadog401k | Commuter benefits | Company offsite | Daily lunch | Dental/visionSenior-level Full TimeGlobal Remote R14h ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-360K5G | API Integration | Claude 3.5 | Distributed Systems | GPT-4oHybrid workSenior-level Full Time北京 R14h ago
-
Senior Machine Learning Engineer USD 164K-258KCI/CD | Distributed Systems | Drift Detection | Embeddings | Entity ResolutionSenior-level Full TimeRemote R14h ago
-
Senior AI Operations Engineer USD 170K-180KAI infrastructure | Azure | CI/CD | Cloud infrastructure | Container Engine for Kubernetes401k match | Employee assistance program | Employee stock purchase plan | Flexible schedule | Flexible spending accountSenior-level Full TimeWork From Home, United States R16h ago
-
Mid-level Full TimeGermany, Berlin - Remote R18h ago
-
Mid-level Full TimeFrance, Paris R18h ago
-
A/B | A/B Testing | AWS | AWS Bedrock | AWS EKSRemote workSenior-level Contract Full TimeRemote job R19h ago
-
Apache Spark | Data Governance | Data Modeling | Data Monitoring | Data QualityAdditional paid leave | Employee stock options | Learning budget | Paid volunteering | Performance bonusesSenior-level Full TimeSwitzerland R20h ago
-
Apache Spark | Data Governance | Data Modeling | Data Monitoring | Data QualityAdditional paid leave | Employee stock options | Learning and development budget | Location Autonomy | Paid volunteeringSenior-level Full TimeFrance R20h ago
-
Apache Flink | Apache Kafka | Apache Spark | Data Governance | Data ModelingAdditional paid leave | Employee stock options | Opportunity to work remotely while traveling | Paid volunteering opportunities | Performance bonusesSenior-level Full TimeGermany R20h ago
-
ABAC | AWS | Apache Airflow | Apache Spark | AzureEnglish language classes compensation | Growth framework | Home office support | Internal workshops and learning initiatives | Legal consultationsSenior-level Full TimeUkraine - Remote R21h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Cause analysis | Code reviewBonus | Equity | Health benefits | Hybrid scheduleMid-level Full TimeMountain View, CA, USA R22h ago
-
Data Engineer USD 123K-151KC++ | Cloud platform | Data Migration | Data Modeling | Data PartitioningBenefits | Hybrid scheduleMid-level Full TimeAustin, TX, USA R22h ago
-
Senior Software Engineer USD 189K-252KAlgorithm Design | Code review | Data Structures | Debugging | Machine LearningBenefits | Bonuses | Equity | Hybrid work scheduleSenior-level Full TimeNew York, NY, USA R22h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Code review | Data AnalysisBenefits | Bonuses | Equity | Hybrid work scheduleMid-level Full TimeMountain View, CA, USA R22h ago
-
Software Engineer USD 149K-211KAlgorithms | Android Development | C# | C++ | Data StructuresHybrid scheduleMid-level Full TimeMountain View, CA, USA R22h ago