Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
conversational
text-generation-inference
WaltonFuture commited on
Commit
4505455
·
verified ·
1 Parent(s): 921d818

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -119,11 +119,11 @@ ZwZ-7B achieves state-of-the-art performance among open-source models on fine-gr
119
  ## Citation
120
 
121
  ```bibtex
122
- @article{wei2025zooming,
123
  title={Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception},
124
  author={Wei, Lai and He, Liangbo and Lan, Jun and Dong, Lingzhong and Cai, Yutong and Li, Siyuan and Zhu, Huijia and Wang, Weiqiang and Kong, Linghe and Wang, Yue and Zhang, Zhuosheng and Huang, Weiran},
125
  journal={arXiv preprint arXiv:2602.11858},
126
- year={2025}
127
  }
128
  ```
129
 
 
119
  ## Citation
120
 
121
  ```bibtex
122
+ @article{wei2026zooming,
123
  title={Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception},
124
  author={Wei, Lai and He, Liangbo and Lan, Jun and Dong, Lingzhong and Cai, Yutong and Li, Siyuan and Zhu, Huijia and Wang, Weiqiang and Kong, Linghe and Wang, Yue and Zhang, Zhuosheng and Huang, Weiran},
125
  journal={arXiv preprint arXiv:2602.11858},
126
+ year={2026}
127
  }
128
  ```
129