AI Inference Engineer QVAC
Tasks
- Collaborate to deploy models into production
- Define inference abstractions for scalability
- Design inference layer for edge devices
- Enhance inference engines for performance
- Evaluate and implement new technologies
- Improve runtime efficiency memory latency throughput stability
- Integrate AI features into existing products
- Optimize C++ inference systems
Perks/Benefits
- Collaboration with top talent
- Fast-paced innovation
- Fully remote
- Global distributed team
- High ownership
- Opportunity to work on cutting-edge AI
Skills/Tech-stack
C++ | Diffusion Models | Edge Computing | Ggml | JavaScript | Language Models | Large Language Models | Llama.cpp | Low Latency | Machine Learning | Memory Optimization | ONNX | Performance optimization | Throughput Optimization | Transformers
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
AI Evaluation | AI evaluation frameworks | Data Analysis | Evaluation Frameworks | ExperimentationAnnual bonus | Fully remote work | Global retreats | Learning stipend | Medical coverageSenior-level Full TimeLuxembourg R1mo ago
-
A/B | A/B Testing | B testing | Data Engineering | Machine LearningAnnual bonus | Global retreats | Home office stipend | Medical coverage | Paid time offSenior-level Full TimeLuxembourg R1mo ago