VITA-MLLM
/

VITA-1.5

Video-Text-to-Text

Model card Files Files and versions

Add model card

#2

by nielsr HF Staff - opened Jan 7, 2025

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +6 -0

README.md ADDED Viewed

	@@ -0,0 +1,6 @@

+---
+pipeline_tag: video-text-to-text
+---
+This repository contains the model of the paper [VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction](https://huggingface.co/papers/2501.01957).
+Code: https://github.com/VITA-MLLM/VITA