zuminghuang commited on
Commit
3ea069e
·
verified ·
1 Parent(s): 8af0881

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -3
README.md CHANGED
@@ -1,3 +1,67 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Infinity-Parser2-2B
2
+
3
+ <p align="center">
4
+ <img src="assets/logo.png" width="400"/>
5
+ <p>
6
+
7
+ <p align="center">
8
+ 💻 <a href="https://github.com/infly-ai/INF-MLLM">Github</a> |
9
+ 📊 <a href="https://huggingface.co/datasets/infly/Infinity-Doc2-xxx">Dataset</a> |
10
+ 📄 <a>Paper (coming soon...)</a> |
11
+ 🚀 <a>Demo (coming soon...)</a>
12
+ </p>
13
+
14
+ # Introduction
15
+
16
+
17
+ ## Key Features
18
+
19
+ # Architecture
20
+
21
+ todo
22
+
23
+ # Performance
24
+
25
+ ## Document Parsing
26
+ ![image](assets/xxx.png)
27
+
28
+ ## Table Parsing
29
+ ![image](assets/xxx.png)
30
+
31
+ ## Math Formula Parsing
32
+ ![image](assets/xxx.png)
33
+
34
+ ## Chart Parsing
35
+ ![image](assets/xxx.png)
36
+
37
+ ## Chemical Formula Parsing
38
+ ![image](assets/xxx.png)
39
+
40
+ ## General Multimodal Understanding
41
+ ![image](assets/xxx.png)
42
+
43
+ # Quick Start
44
+
45
+ todo
46
+
47
+ # Visualization
48
+
49
+ ## Comparison Examples
50
+ ![image](assets/xxx.jpeg)
51
+
52
+ # Limitation & Future Work
53
+
54
+ ## Limitations
55
+
56
+ ## Future Work
57
+
58
+ # Acknowledgments
59
+ We would like to thank [Qwen3-VL](https://github.com/QwenLM/Qwen3-VL), [ms-swift](https://github.com/modelscope/ms-swift), [verl](https://github.com/verl-project/verl), [olmOCR-bench](https://huggingface.co/datasets/allenai/olmOCR-bench) and [OmniDocBench](https://github.com/opendatalab/OmniDocBench) for providing data, code and models.
60
+
61
+ # Citation
62
+
63
+ Coming soon...
64
+
65
+ # License
66
+
67
+ This model is licensed under apache-2.0.