Qwen
/

Qwen3-VL-Reranker-2B

@@ -1,17 +1,22 @@
 ---
-license: apache-2.0
 base_model:
 - Qwen/Qwen3-VL-2B-Instruct
 tags:
 - transformers
 - multimodal rerank
 ---
 # Qwen3-VL-Reranker-2B
 <p align="center">
     <img src="https://model-demo.oss-cn-hangzhou.aliyuncs.com/Qwen3-VL-Reranker.png" width="400"/>
 <p>
 ## Highlights
 The **Qwen3-VL-Embedding** and **Qwen3-VL-Reranker** model series are the latest additions to the Qwen family, built upon the recently open-sourced and powerful Qwen3-VL foundation model. Specifically designed for multimodal information retrieval and cross-modal understanding, this suite accepts diverse inputs including text, images, screenshots, and videos, as well as inputs containing a mixture of these modalities.
@@ -49,7 +54,6 @@ For more details, including benchmark evaluation, hardware requirements, and inf
 > - `Quantization Support` indicates the supported quantization post process for the output embedding.
 > - `MRL Support` indicates whether the embedding model supports custom dimensions for the final embedding.
 > - `Instruction Aware` notes whether the embedding or reranking model supports customizing the input instruction according to different tasks.
-> Our evaluation indicates that, for most downstream tasks, using instructions (instruct) typically yields an improvement of 1% to 5% compared to not using them. Therefore, we recommend that developers create tailored instructions specific to their tasks and scenarios. In multilingual contexts, we also advise users to write their instructions in English, as most instructions utilized during the model training process were originally written in English.
 ## Model Performance
@@ -187,7 +191,8 @@ def main():
     for query_dict in queries:
         query_text = query_dict.get('text', '')
-        print(f"\nQuery: {query_text}")
         scores = []
         for doc_dict in documents:
@@ -210,10 +215,10 @@ For more usage examples, please visit our [GitHub repository](https://github.com
 If you find our work helpful, feel free to give us a cite.
-```
 @article{qwen3vlembedding,
   title={Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking},
-  author={Li, Mingxin and Zhang, Yanzhao and Long, Dingkun and Chen Keqin and Song, Sibo and Bai, Shuai and Yang, Zhibo and Xie, Pengjun and Yang, An and Liu, Dayiheng and Zhou, Jingren and Lin, Junyang},
   journal={arXiv},
   year={2026}
 }

 ---
 base_model:
 - Qwen/Qwen3-VL-2B-Instruct
+license: apache-2.0
+library_name: transformers
+pipeline_tag: text-ranking
 tags:
 - transformers
 - multimodal rerank
 ---
 # Qwen3-VL-Reranker-2B
 <p align="center">
     <img src="https://model-demo.oss-cn-hangzhou.aliyuncs.com/Qwen3-VL-Reranker.png" width="400"/>
 <p>
+This repository contains the **Qwen3-VL-Reranker-2B** model, as presented in the paper [Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking](https://huggingface.co/papers/2601.04720).
 ## Highlights
 The **Qwen3-VL-Embedding** and **Qwen3-VL-Reranker** model series are the latest additions to the Qwen family, built upon the recently open-sourced and powerful Qwen3-VL foundation model. Specifically designed for multimodal information retrieval and cross-modal understanding, this suite accepts diverse inputs including text, images, screenshots, and videos, as well as inputs containing a mixture of these modalities.
 > - `Quantization Support` indicates the supported quantization post process for the output embedding.
 > - `MRL Support` indicates whether the embedding model supports custom dimensions for the final embedding.
 > - `Instruction Aware` notes whether the embedding or reranking model supports customizing the input instruction according to different tasks.
 ## Model Performance
     for query_dict in queries:
         query_text = query_dict.get('text', '')
+        print(f"
+Query: {query_text}")
         scores = []
         for doc_dict in documents:
 If you find our work helpful, feel free to give us a cite.
+```bibtex
 @article{qwen3vlembedding,
   title={Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking},
+  author={Li, Mingxin and Zhang, Yanzhao and Long, Dingkun and Chen, Keqin and Song, Sibo and Bai, Shuai and Yang, Zhibo and Xie, Pengjun and Yang, An and Liu, Dayiheng and Zhou, Jingren and Lin, Junyang},
   journal={arXiv},
   year={2026}
 }