AI Inference Engineer QVAC
Tasks
- Adapt inference engines for performance and compatibility
- Define core inference abstractions
- Design inference layer for edge devices
- Develop C plus plus inference systems
- Evaluate and implement new inference technologies
- Integrate AI features into existing products
- Optimize inference runtime efficiency
- Transition models from research to production
Perks/Benefits
- Collaboration with top talent
- Fully remote
- Global distributed team
- High ownership
- Innovation-focused environment
Skills/Tech-stack
C plus plus | Deep learning | Diffusion Models | Edge Computing | Ggml | JavaScript | Language Models | Large Language Models | Latency optimization | Llama CPP | Machine Learning | Memory Optimization | Model Deployment | ONNX | Throughput Optimization | Transformers
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Roles
Related jobs
-
API Gateway | AWS | AWS Lambda | Amazon Aurora | Amazon DynamoDBAnnual performance bonus | Bereavement leave | Dental insurance | Education reimbursement | Family bondingSenior-level Full TimeSantiago, SANTIAGO, Chile R21d ago
-
Cloud Platforms | Data Exploration | Docker | ML algorithms | Model DeploymentEmployee referral bonus | Fully remote | Health benefits | Language courses discount | Paid vacationMid-level Full TimeChile R1mo ago
-
AI Evaluation | Data Analysis | Machine Learning | NLP | PythonAnnual bonus | Global retreats | Home office stipend | Learning stipend | Medical coverageSenior-level Full TimeChile R1mo ago
-
A/B | A/B Testing | B testing | Data Engineering | Machine LearningAnnual bonus | Global retreats | Home office stipend | Learning stipend | Medical coverageSenior-level Full TimeChile R1mo ago