AI Inference Engineer QVAC
Tasks
- Adapt inference engines for compatibility
- Collaborate with research teams for production deployments
- Define core inference abstractions
- Design inference layer for edge devices
- Develop C++ inference systems
- Enhance throughput and long session stability
- Evaluate and implement new optimization technologies
- Improve memory usage and latency
- Integrate AI features into products
- Optimize inference performance and reliability
Perks/Benefits
- Collaborative environment
- Competitive compensation
- Fast-paced innovation
- Fully remote
- Global distributed work environment
- High ownership
Skills/Tech-stack
C++ | Diffusion Models | Edge Computing | Ggml | JavaScript | Language Models | Large Language Models | Llama.cpp | Machine Learning | ONNX | Transformers
Education
Roles
Related jobs
-
Data Analysis | Data Science | Evaluation Frameworks | Language Models | Large Language ModelsAnnual bonus | Global retreats | Home office stipend | Learning stipend | Medical coverageSenior-level Full TimeNorway R1mo ago
-
A/B | A/B Testing | B testing | Backend Development | Data EngineeringAnnual bonus | Company retreats | Home office stipend | Learning stipend | Medical coverageSenior-level Full TimeNorway R1mo ago