Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
conversational
text-generation-inference

Add library_name and pipeline_tag metadata

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +9 -6
README.md CHANGED
@@ -1,18 +1,21 @@
1
  ---
2
- license: apache-2.0
 
3
  datasets:
4
  - inclusionAI/ZoomBench
5
  - inclusionAI/ZwZ-RL-VQA
6
  language:
7
  - en
8
- base_model:
9
- - Qwen/Qwen2.5-VL-7B-Instruct
 
10
  ---
 
11
  # ZwZ-7B
12
 
13
  <div align="center">
14
 
15
- πŸ“ƒ [Paper](https://arxiv.org/pdf/2602.11858) | 🏠 [Project](https://github.com/inclusionAI/Zooming-without-Zooming) | πŸ€— [Collection](https://huggingface.co/collections/inclusionAI/zooming-without-zooming)
16
 
17
  </div>
18
 
@@ -37,7 +40,7 @@ Traditional "Thinking-with-Images" methods zoom into regions of interest during
37
  ### Installation
38
 
39
  ```bash
40
- pip install transformers accelerate torch
41
  ```
42
 
43
  ### Inference
@@ -105,7 +108,7 @@ We introduce [ZoomBench](https://huggingface.co/datasets/inclusionAI/ZoomBench),
105
 
106
  ### Benchmark Results
107
 
108
- ZwZ-7B achieves state-of-the-art performance among open-source models on fine-grained perception benchmarks. Please refer to the [paper](https://arxiv.org/pdf/YOUR_ARXIV_ID) for detailed results.
109
 
110
  ## Limitations
111
 
 
1
  ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-VL-7B-Instruct
4
  datasets:
5
  - inclusionAI/ZoomBench
6
  - inclusionAI/ZwZ-RL-VQA
7
  language:
8
  - en
9
+ license: apache-2.0
10
+ library_name: transformers
11
+ pipeline_tag: image-text-to-text
12
  ---
13
+
14
  # ZwZ-7B
15
 
16
  <div align="center">
17
 
18
+ πŸ“ƒ [Paper](https://huggingface.co/papers/2602.11858) | 🏠 [Project](https://github.com/inclusionAI/Zooming-without-Zooming) | πŸ€— [Collection](https://huggingface.co/collections/inclusionAI/zooming-without-zooming)
19
 
20
  </div>
21
 
 
40
  ### Installation
41
 
42
  ```bash
43
+ pip install transformers accelerate torch qwen-vl-utils
44
  ```
45
 
46
  ### Inference
 
108
 
109
  ### Benchmark Results
110
 
111
+ ZwZ-7B achieves state-of-the-art performance among open-source models on fine-grained perception benchmarks. Please refer to the [paper](https://huggingface.co/papers/2602.11858) for detailed results.
112
 
113
  ## Limitations
114