Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
conversational
text-generation-inference
WaltonFuture commited on
Commit
87b8d51
·
verified ·
1 Parent(s): 81cc970

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -2,7 +2,7 @@
2
 
3
  <div align="center">
4
 
5
- 📃 [Paper](https://arxiv.org/pdf/YOUR_ARXIV_ID) | 🏠 [Project](https://github.com/inclusionAI/Zooming-without-Zooming) | 🤗 [Collection](https://huggingface.co/collections/inclusionAI/zooming-without-zooming)
6
 
7
  </div>
8
 
@@ -109,7 +109,7 @@ ZwZ-7B achieves state-of-the-art performance among open-source models on fine-gr
109
  @article{wei2025zooming,
110
  title={Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception},
111
  author={Wei, Lai and He, Liangbo and Lan, Jun and Dong, Lingzhong and Cai, Yutong and Li, Siyuan and Zhu, Huijia and Wang, Weiqiang and Kong, Linghe and Wang, Yue and Zhang, Zhuosheng and Huang, Weiran},
112
- journal={arXiv preprint arXiv:2602.XXXXX},
113
  year={2025}
114
  }
115
  ```
 
2
 
3
  <div align="center">
4
 
5
+ 📃 [Paper](https://arxiv.org/pdf/2602.11858) | 🏠 [Project](https://github.com/inclusionAI/Zooming-without-Zooming) | 🤗 [Collection](https://huggingface.co/collections/inclusionAI/zooming-without-zooming)
6
 
7
  </div>
8
 
 
109
  @article{wei2025zooming,
110
  title={Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception},
111
  author={Wei, Lai and He, Liangbo and Lan, Jun and Dong, Lingzhong and Cai, Yutong and Li, Siyuan and Zhu, Huijia and Wang, Weiqiang and Kong, Linghe and Wang, Yue and Zhang, Zhuosheng and Huang, Weiran},
112
+ journal={arXiv preprint arXiv:2602.11858},
113
  year={2025}
114
  }
115
  ```