Multi-Images Multi-Audio Multi-turn Multi-Modal bilingual TinyLlama
SigClip Encoder + Whisper Encoder + TinyLlama, source code at https://github.com/mesolitica/multimodal-LLM
- Downloads last month
- -
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support