NCUTNLP commited on
Commit
6d7452b
Β·
verified Β·
1 Parent(s): 8100673

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +92 -3
README.md CHANGED
@@ -1,3 +1,92 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # CrossLing-OCR-Mini
2
+
3
+ πŸš€ **CrossLing-OCR-Mini** is a lightweight yet powerful OCR model designed for **low-resource multilingual and complex-layout document scenarios**.
4
+ The model focuses on accurate text recognition while preserving original document structure, making it suitable for multilingual document understanding research.
5
+
6
+ ---
7
+
8
+ ## πŸ” Model Overview
9
+
10
+ CrossLing-OCR-Mini is optimized for **low-resource and structurally complex languages**, achieving strong performance across **11 languages** while remaining deployable on **consumer-grade hardware**.
11
+
12
+ **Key features:**
13
+ - Accurate text recognition with layout/format preservation
14
+ - Optimized for low-resource scripts
15
+ - Lightweight (~580MB) and easy to deploy
16
+ - Designed for research and benchmarking purposes
17
+
18
+ ### Supported & Optimized Languages
19
+ - High-resource: Chinese, English
20
+ - Low-resource (specially optimized):
21
+ **Tibetan, Mongolian, Kazakh, Kyrgyz, Zhuang**
22
+
23
+ Experimental results show that CrossLing-OCR-Mini **outperforms or matches mainstream OCR systems** on multiple low-resource languages.
24
+
25
+ ---
26
+
27
+ ## πŸ§ͺ Performance Notes & Limitations
28
+
29
+ While CrossLing-OCR-Mini achieves strong overall performance, we note that:
30
+ - **Mongolian and Uyghur** OCR accuracy still has room for improvement
31
+ - Performance may degrade in extremely noisy, handwritten, or out-of-distribution scenarios
32
+
33
+ These limitations will be addressed in future iterations of the model.
34
+
35
+ ---
36
+
37
+ ## πŸ“¦ Model Variants
38
+
39
+ | Version | Purpose | Availability |
40
+ |------|------|------|
41
+ | **CrossLing-OCR-Mini** | Research & academic use | βœ… Open-sourced |
42
+ | **CrossLing-OCR-Pro-Preview** | Commercial / production use | πŸ”’ Contact required |
43
+
44
+ πŸ“© For access to **CrossLing-OCR-Pro-Preview**, please contact:
45
+ **zhumx@ncut.edu.cn**
46
+
47
+ ---
48
+
49
+ ## 🎯 Intended Use
50
+
51
+ **This model is intended solely for:**
52
+ - Academic research
53
+ - Scientific experimentation
54
+ - Benchmarking and method comparison
55
+ - Low-resource language OCR studies
56
+
57
+ ---
58
+
59
+ ## 🚫 Prohibited Use & Disclaimer
60
+
61
+ This model **must not be used** for:
62
+ - Any illegal or unlawful activities
63
+ - Any applications that violate social ethics, public order, or applicable laws
64
+ - Surveillance, discrimination, or harmful decision-making systems
65
+
66
+ ⚠️ **Disclaimer**:
67
+ - Any misuse of this model is **strictly the responsibility of the user**
68
+ - The authors and maintainers **do not endorse** and are **not liable for** any consequences arising from improper or malicious use
69
+ - Views or actions enabled by this model **do not reflect the opinions of the authors**
70
+
71
+ ---
72
+
73
+ ## βš–οΈ License
74
+
75
+ This model is released **for research purposes only**.
76
+ Commercial use is **not permitted** without explicit authorization.
77
+
78
+ (Please contact the authors for commercial licensing or extended usage.)
79
+
80
+ ---
81
+
82
+ ## πŸ“– Citation
83
+
84
+ If you use CrossLing-OCR-Mini in your research, please cite:
85
+
86
+ ```bibtex
87
+ @misc{crossling-ocr-mini,
88
+ title = {CrossLing-OCR: Advancing Low-Resource Multilingual Text Recognition through Multi-Stage Vision-Language Training},
89
+ author = {CrossLing Team},
90
+ year = {2025},
91
+ note = {Research-only OCR model}
92
+ }