AI Inference Engineer QVAC
Tasks
- Adapt inference engines for performance and compatibility
- Define core inference abstractions
- Design inference layer for edge devices
- Develop C plus plus inference systems
- Ensure long session stability
- Evaluate and implement new technologies
- Improve memory latency and throughput
- Integrate AI features into existing products
- Optimize inference runtime performance
- Transition models to production deployments
Perks/Benefits
- Fully remote
- Global distributed team
- High ownership
- Innovation-focused environment
- Top talent collaboration
Skills/Tech-stack
C plus plus | Deep learning | Edge AI | Ggml | JavaScript | Language Models | Large Language Models | Latency optimization | Llama.cpp | Machine Learning | Memory Optimization | ONNX | Performance optimization | Throughput Optimization | Transformers
Education
N/A
Roles
Related jobs
-
Applied AI Engineer SEK 655K-838KAI Agents | Agentic Architectures | Anthropic Claude | Anthropic Claude API | AuthenticationEmployee stock option plan | Flexible working options | Health insurance | Home-office allowance | Parental leaveSenior-level Full TimeRemote Sweden R2d ago
-
Embedded AI Engineer, Lund SEK 660K-804KASIC design | C++ | Computer Vision | Deep learning | Embedded LinuxCompany bonus | Health insurance | Wellbeing initiatives | Wellness allowance | Work-life balanceSenior-level Full TimeSweden - Lund R9d ago
-
Data Analysis | Embeddings | Evaluation Frameworks | Machine Learning | Monitoring ToolsEquity | Global retreats | Home office stipend | Learning stipend | Medical coverageSenior-level Full TimeSweden R1mo ago
-
A/B | A/B Testing | B testing | Data Engineering | Machine LearningAnnual bonus | Fully remote work | Global retreats | Home office stipend | Learning stipendSenior-level Full TimeSweden R1mo ago