Update README.md
Browse files
README.md
CHANGED
|
@@ -10,6 +10,8 @@ pipeline_tag: visual-document-retrieval
|
|
| 10 |
|
| 11 |
# ColQwen3: Visual Retriever based on Qwen3-VL-2B-Instruct with ColBERT strategy
|
| 12 |
|
|
|
|
|
|
|
| 13 |
ColQwen is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
|
| 14 |
It is a [Qwen3-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
|
| 15 |
It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
|
|
|
|
| 10 |
|
| 11 |
# ColQwen3: Visual Retriever based on Qwen3-VL-2B-Instruct with ColBERT strategy
|
| 12 |
|
| 13 |
+
### source code: [Mungeryang/colqwen3](https://github.com/Mungeryang/colqwen3)
|
| 14 |
+
|
| 15 |
ColQwen is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
|
| 16 |
It is a [Qwen3-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
|
| 17 |
It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
|