Multimodal AI Systems Architect (AI Engineering)
Tasks
- Architect multimodal RAG systems for videos and PDFs
- Integrate audio native models into agent reasoning loops
- Integrate vision encoders into agent reasoning loops
- Optimize streaming latency for voice to voice AI interactions
Perks/Benefits
- N/A
Skills/Tech-stack
CLIP | Cross-modal alignment | Multimodal LLM | Streaming Architecture | WebRTC | Whisper
Education
N/A
Roles
Related jobs
-
API Gateway | AWS | AWS Lambda | AWS Step Functions | Agent communicationFlexible work arrangements | Mentorship | Work-life balanceSenior-level Full TimeSydney, New South Wales, AUS6d ago