Alexislhb commited on
Commit
1517266
·
verified ·
1 Parent(s): 3e703ac

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -6,13 +6,12 @@ tags:
6
  - vision-language
7
  - cinematography
8
  - shotbench
9
- - arxiv:2506.21356
10
  ---
11
 
12
  ## Model description
13
 
14
  This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), trained by supervised fine-tuning and GRPO on the largest and high-quality dataset for cinematic language understanding to date. It currently achieves state-of-the-art performance on [ShotBench](https://vchitect.github.io/ShotBench-project/), a comprehensive benchmark for evaluating cinematography understanding in vision-language models.
15
-
16
  ### Demo Code
17
 
18
  **Image**
 
6
  - vision-language
7
  - cinematography
8
  - shotbench
 
9
  ---
10
 
11
  ## Model description
12
 
13
  This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), trained by supervised fine-tuning and GRPO on the largest and high-quality dataset for cinematic language understanding to date. It currently achieves state-of-the-art performance on [ShotBench](https://vchitect.github.io/ShotBench-project/), a comprehensive benchmark for evaluating cinematography understanding in vision-language models.
14
+ Please visit our [paper](https://arxiv.org/abs/2506.21356) for more details.
15
  ### Demo Code
16
 
17
  **Image**