Update README.md
Browse files
README.md
CHANGED
|
@@ -12,15 +12,7 @@ pipeline_tag: image-text-to-text
|
|
| 12 |
library_name: transformers
|
| 13 |
---
|
| 14 |
|
| 15 |
-
|
| 16 |
-
This repository contains the model described in [Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources](https://huggingface.co/papers/2504.00595).
|
| 17 |
-
|
| 18 |
-
Project page: https://victorwz.github.io/Open-Qwen2VL
|
| 19 |
-
|
| 20 |
-
For code and usage instructions, please refer to the official codebase: https://github.com/Victorwz/Open-Qwen2VL
|
| 21 |
-
|
| 22 |
-
|
| 23 |
-
# Model Card for Open-Qwen2VL
|
| 24 |
|
| 25 |
Open-Qwen2VL-base is a pre-trained base multimodal model that takes images and text as input and produces text as output. This model is described in the paper [Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources](https://huggingface.co/papers/2504.00595). The code is available at [https://github.com/Victorwz/Open-Qwen2VL](https://github.com/Victorwz/Open-Qwen2VL).
|
| 26 |
|
|
|
|
| 12 |
library_name: transformers
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# Model Card for Open-Qwen2VL-base
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 16 |
|
| 17 |
Open-Qwen2VL-base is a pre-trained base multimodal model that takes images and text as input and produces text as output. This model is described in the paper [Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources](https://huggingface.co/papers/2504.00595). The code is available at [https://github.com/Victorwz/Open-Qwen2VL](https://github.com/Victorwz/Open-Qwen2VL).
|
| 18 |
|