Upload README.md
Browse files
README.md
CHANGED
|
@@ -2,21 +2,19 @@
|
|
| 2 |
<img src="imgs/logo.jpg" width="80%" >
|
| 3 |
</div>
|
| 4 |
|
| 5 |
-
|
| 6 |
<p align="center">
|
| 7 |
-
๐ป <a href="https://logics.alibaba-inc.com/parsing/?spm=label.2ef5001f.0.0.1c702159dQbTRd">HomePage</a>   |   ๐ค <a href="https://github.com/alibaba/Logics-Parsing">GitHub</a>   |   ๐ค <a href="https://www.modelscope.cn/studios/Alibaba-DT/
|
| 8 |
</p>
|
| 9 |
|
| 10 |
|
| 11 |
<div align="center">
|
| 12 |
-
<img src="imgs/
|
| 13 |
</div>
|
| 14 |
-
|
| 15 |
<div align="center">
|
| 16 |
-
<img src="imgs/
|
| 17 |
</div>
|
| 18 |
|
| 19 |
-
|
| 20 |
## Updates
|
| 21 |
* [2026/02/13] ๐๐๐๐๐ We release Logics-Parsing-v2 Model.
|
| 22 |
* [2025/09/25] ๐๐๐We release Logics-Parsing Model.
|
|
@@ -26,10 +24,12 @@
|
|
| 26 |
**Logics-Parsing-v2** is an advanced evolution of the previously proposed Logics-Parsing (v1). It inherits all the core capabilities of v1 model, while demonstrating more powerful capabilities on handling complex documents. Furthermore, it extends support for **Parsing-2.0** scenarios, enabling structured parsing of musical sheets, flowcharts, as well as code/pseudocode blocks.
|
| 27 |
|
| 28 |
<div align="center">
|
| 29 |
-
<img src="imgs/overview.png" alt="LogicsDocBench ๆฆ่ง" style="width: 800px; height:
|
| 30 |
</div>
|
| 31 |
|
|
|
|
| 32 |
## Key Features
|
|
|
|
| 33 |
* **Effortless End-to-End Processing**
|
| 34 |
|
| 35 |
* End-to-end recognition and parsing for various kinds of document elements within a single model.
|
|
@@ -51,6 +51,7 @@
|
|
| 51 |
|
| 52 |
|
| 53 |
## Benchmark
|
|
|
|
| 54 |
### Comparisons on LogicsDocBench
|
| 55 |
|
| 56 |
We introduce **LogicsDocBench**, a new comprehensive evaluation benchmark comprising 900 carefully selected PDF pages, covering both traditional document Parsing-1.0 tasks and the newly introduced Parsing-2.0 scenarios. This benchmark is designed to better assess modelsโ capabilities in complex and diverse real-world documents parsing. The dataset is organized into three core document subsets:
|
|
@@ -85,7 +86,7 @@ The histogram below provides a more intuitive visualization of the advantages of
|
|
| 85 |
|
| 86 |
<div align="center">
|
| 87 |
<img src="imgs/benchmark_clean_morandi_split.png" width="100%" >
|
| 88 |
-
</div>
|
| 89 |
|
| 90 |
|
| 91 |
### Comparisons on OmniDocBench_v1.5
|
|
@@ -100,6 +101,7 @@ _\* The model results in the table are sourced from the official OmniDocBench we
|
|
| 100 |
|
| 101 |
|
| 102 |
## Quick Start
|
|
|
|
| 103 |
### 1. Installation
|
| 104 |
```shell
|
| 105 |
conda create -n logis-parsing-v2 python=3.10
|
|
@@ -154,6 +156,7 @@ python3 inference_v2.py --image_path PATH_TO_INPUT_IMG --output_path PATH_TO_OUT
|
|
| 154 |
|
| 155 |
|
| 156 |
|
|
|
|
| 157 |
## Acknowledgments
|
| 158 |
|
| 159 |
|
|
|
|
| 2 |
<img src="imgs/logo.jpg" width="80%" >
|
| 3 |
</div>
|
| 4 |
|
|
|
|
| 5 |
<p align="center">
|
| 6 |
+
๐ป <a href="https://logics.alibaba-inc.com/parsing/?spm=label.2ef5001f.0.0.1c702159dQbTRd">HomePage</a>   |   ๐ค <a href="https://github.com/alibaba/Logics-Parsing">GitHub</a>   |   ๐ค <a href="https://www.modelscope.cn/studios/Alibaba-DT/Logics-Parsing/summary">Demo</a>
|
| 7 |
</p>
|
| 8 |
|
| 9 |
|
| 10 |
<div align="center">
|
| 11 |
+
<img src="imgs/benchmark_clean_morandi_logicsdocbench.png" alt="LogicsDocBench results" style="width: 800px; height: auto;">
|
| 12 |
</div>
|
| 13 |
+
<br><br>
|
| 14 |
<div align="center">
|
| 15 |
+
<img src="imgs/benchmark_clean_morandi_omni.png" alt="OmniDocBench-v1.5 results" style="width: 800px; height: auto;">
|
| 16 |
</div>
|
| 17 |
|
|
|
|
| 18 |
## Updates
|
| 19 |
* [2026/02/13] ๐๐๐๐๐ We release Logics-Parsing-v2 Model.
|
| 20 |
* [2025/09/25] ๐๐๐We release Logics-Parsing Model.
|
|
|
|
| 24 |
**Logics-Parsing-v2** is an advanced evolution of the previously proposed Logics-Parsing (v1). It inherits all the core capabilities of v1 model, while demonstrating more powerful capabilities on handling complex documents. Furthermore, it extends support for **Parsing-2.0** scenarios, enabling structured parsing of musical sheets, flowcharts, as well as code/pseudocode blocks.
|
| 25 |
|
| 26 |
<div align="center">
|
| 27 |
+
<img src="imgs/overview.png" alt="LogicsDocBench ๆฆ่ง" style="width: 800px; height: auto;">
|
| 28 |
</div>
|
| 29 |
|
| 30 |
+
|
| 31 |
## Key Features
|
| 32 |
+
|
| 33 |
* **Effortless End-to-End Processing**
|
| 34 |
|
| 35 |
* End-to-end recognition and parsing for various kinds of document elements within a single model.
|
|
|
|
| 51 |
|
| 52 |
|
| 53 |
## Benchmark
|
| 54 |
+
|
| 55 |
### Comparisons on LogicsDocBench
|
| 56 |
|
| 57 |
We introduce **LogicsDocBench**, a new comprehensive evaluation benchmark comprising 900 carefully selected PDF pages, covering both traditional document Parsing-1.0 tasks and the newly introduced Parsing-2.0 scenarios. This benchmark is designed to better assess modelsโ capabilities in complex and diverse real-world documents parsing. The dataset is organized into three core document subsets:
|
|
|
|
| 86 |
|
| 87 |
<div align="center">
|
| 88 |
<img src="imgs/benchmark_clean_morandi_split.png" width="100%" >
|
| 89 |
+
</div><br>
|
| 90 |
|
| 91 |
|
| 92 |
### Comparisons on OmniDocBench_v1.5
|
|
|
|
| 101 |
|
| 102 |
|
| 103 |
## Quick Start
|
| 104 |
+
|
| 105 |
### 1. Installation
|
| 106 |
```shell
|
| 107 |
conda create -n logis-parsing-v2 python=3.10
|
|
|
|
| 156 |
|
| 157 |
|
| 158 |
|
| 159 |
+
|
| 160 |
## Acknowledgments
|
| 161 |
|
| 162 |
|