Add model card
#2
by
nielsr HF Staff - opened
README.md
ADDED
|
@@ -0,0 +1,6 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
pipeline_tag: video-text-to-text
|
| 3 |
+
---
|
| 4 |
+
This repository contains the model of the paper [VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction](https://huggingface.co/papers/2501.01957).
|
| 5 |
+
|
| 6 |
+
Code: https://github.com/VITA-MLLM/VITA
|