Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -81,9 +81,6 @@ We conducted a comprehensive evaluation of **MOSS-VL-Instruct-0408** across four
|
|
| 81 |
* **๐ง Robust Multimodal Reasoning**: MOSS-VL demonstrates solid logical inference, staying highly competitive with the latest Qwen series on challenging reasoning suites such as `CVBench` and `VisuLogic`.
|
| 82 |
* **๐ Reliable Document Understanding**: While the model is primarily optimized for general perception and video, MOSS-VL still delivers **83.9** on OCR and document analysis, ensuring dependable extraction of text and structured information.
|
| 83 |
|
| 84 |
-
<p align="center">
|
| 85 |
-
<img src="assets/benchmark_table.png" alt="MOSS-VL Benchmark Table" width="100%"/>
|
| 86 |
-
</p>
|
| 87 |
|
| 88 |
<p align="center">
|
| 89 |
<img src="assets/MOSS-VL-benchmark.png" alt="MOSS-VL Benchmark Results" width="100%"/>
|
|
@@ -92,7 +89,7 @@ We conducted a comprehensive evaluation of **MOSS-VL-Instruct-0408** across four
|
|
| 92 |
## ๐ Quickstart
|
| 93 |
### ๐ ๏ธ Requirements
|
| 94 |
|
| 95 |
-
Installation
|
| 96 |
|
| 97 |
```bash
|
| 98 |
conda create -n moss_vl python=3.12 pip -y
|
|
|
|
| 81 |
* **๐ง Robust Multimodal Reasoning**: MOSS-VL demonstrates solid logical inference, staying highly competitive with the latest Qwen series on challenging reasoning suites such as `CVBench` and `VisuLogic`.
|
| 82 |
* **๐ Reliable Document Understanding**: While the model is primarily optimized for general perception and video, MOSS-VL still delivers **83.9** on OCR and document analysis, ensuring dependable extraction of text and structured information.
|
| 83 |
|
|
|
|
|
|
|
|
|
|
| 84 |
|
| 85 |
<p align="center">
|
| 86 |
<img src="assets/MOSS-VL-benchmark.png" alt="MOSS-VL Benchmark Results" width="100%"/>
|
|
|
|
| 89 |
## ๐ Quickstart
|
| 90 |
### ๐ ๏ธ Requirements
|
| 91 |
|
| 92 |
+
Installation:
|
| 93 |
|
| 94 |
```bash
|
| 95 |
conda create -n moss_vl python=3.12 pip -y
|