Multimodal AI Systems Architect (AI Engineering)
Tasks
- Architect multimodal RAG systems for videos and PDFs
- Integrate audio native models into agent reasoning loops
- Integrate vision encoders into agent reasoning loops
- Optimize streaming latency for voice to voice AI interactions
Perks/Benefits
- N/A
Skills/Tech-stack
CLIP | Cross-modal alignment | Multimodal LLM | Streaming Architecture | WebRTC | Whisper
Education
N/A
Roles
Related jobs
- No jobs found.