AI Inference Engineer QVAC
Tasks
- Adapt inference engines for performance and compatibility
- Collaborate to transition models to production
- Define inference abstractions for scalable maintainable deployment
- Design inference layer for edge devices
- Evaluate and implement new technologies
- Improve memory usage latency throughput
- Integrate AI features into products
- Maintain robust efficient scalable systems
- Optimize inference runtime performance
Perks/Benefits
- Collaboration with top talent
- Fully remote
- Globally distributed team
- High ownership
- Innovation and experimentation
Skills/Tech-stack
C++ | Deep learning | Diffusion Models | Edge Computing | Ggml | JavaScript | Language Models | Large Language Models | Llama.cpp | ONNX | Transformers
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
Related jobs
-
API Integration | Airtable | Anthropic API | CRM | ChatGPTFlexible scheduling | Remote workEntry-level Full TimeKenya - Remote R1d ago
-
AI & Cloud Engineering USD 20K-20KAPI Gateway | AWS Bedrock | AWS CDK | AWS CloudFormation | AWS Lambda100 percent remote | Full-timeMid-level Full TimeKenya - Remote R14d ago