AI Inference Engineer QVAC
Tasks
- Collaborate with research teams for production deployments
- Define and maintain inference abstractions
- Design inference layer for edge devices
- Enhance inference engines performance and compatibility
- Evaluate and adopt new technologies
- Improve runtime efficiency memory latency throughput
- Integrate AI features into existing products
- Optimize C++ inference systems
Perks/Benefits
- Collaboration with top talent
- Fully remote
- Globally distributed work environment
- High ownership
- Innovation-focused environment
Skills/Tech-stack
C++ | Diffusion Models | Edge Computing | Ggml | JavaScript | Language Models | Large Language Models | Latency optimization | Llama.cpp | Machine Learning | Machine Learning Inference | Memory Optimization | ONNX | Throughput Optimization | Transformers
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
Related jobs
-
Machine Learning Engineer NGN 3600K-3600KA/B | A/B Testing | Airflow | B testing | Causal InferenceAnnual bonus | Collaborative work culture | Health insurance | Internal technical talks | Knowledge sharingSenior-level Full TimeRemote, Nigeria R7d ago
-
AI Evaluation | Dashboard Development | Data Analysis | Data Science | Evaluation PipelinesEquity participation | Global retreats | Home office stipend | Learning stipend | Medical coverageSenior-level Full TimeNigeria R1mo ago
-
A/B | A/B Testing | B testing | Data Engineering | Machine LearningFully remote work | Global retreats | Home office stipend | Learning stipend | Medical coverageSenior-level Full TimeNigeria R1mo ago