Multi-Images Multi-Audio Multi-turn Multi-Modal bilingual TinyLlama

SigClip Encoder + Whisper Encoder + TinyLlama, source code at https://github.com/mesolitica/multimodal-LLM

Downloads last month
-
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support