Zhang199 commited on
Commit
28a1978
·
verified ·
1 Parent(s): 30153bd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -7,4 +7,9 @@ pipeline_tag: image-text-to-text
7
 
8
  [![arXiv](https://img.shields.io/badge/Arxiv-2402.14289-b31b1b.svg?logo=arXiv)](https://github.com/ZhangXJ199/TinyLLaVA-Video-R1)[![Github](https://img.shields.io/badge/Github-Github-blue.svg)](https://github.com/ZhangXJ199/TinyLLaVA-Video-R1)
9
 
 
10
 
 
 
 
 
 
7
 
8
  [![arXiv](https://img.shields.io/badge/Arxiv-2402.14289-b31b1b.svg?logo=arXiv)](https://github.com/ZhangXJ199/TinyLLaVA-Video-R1)[![Github](https://img.shields.io/badge/Github-Github-blue.svg)](https://github.com/ZhangXJ199/TinyLLaVA-Video-R1)
9
 
10
+ Here, we introduce a small-scale video reasoning model TinyLLaVA-Video-R1, based on the traceably trained model [TinyLLaVA-Video](https://github.com/ZhangXJ199/TinyLLaVA-Video). After reinforcement learning on general Video-QA datasets, the model not only significantly improves its reasoning and thinking abilities, but also exhibits the emergent characteristic of “aha moments”.
11
 
12
+ ### Result
13
+ | Model (HF Path) | Video-MME | MVBench | MLVU | MMVU |
14
+ | :----------------------------------------: | ------------- | ------- | -------------- | ---------- |
15
+ | [Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512](https://huggingface.co/Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512) | 46.6 | 49.5 | 52.4 | 46.9 |