boatbomber commited on
Commit
c464cec
·
verified ·
1 Parent(s): 571841b

Add rough readme template for later

Browse files
Files changed (1) hide show
  1. README.md +89 -1
README.md CHANGED
@@ -12,4 +12,92 @@ tags:
12
  - Akkadian
13
  - PaddleOCR
14
  - PaddlePaddle
15
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  - Akkadian
13
  - PaddleOCR
14
  - PaddlePaddle
15
+ ---
16
+
17
+ # NabuOCR
18
+
19
+ *Ancient Cuneiform Meets Modern AI*
20
+
21
+ NabuOCR is a specialized OCR model for transliterating ancient cuneiform tablets directly from images to ATF (ASCII Transliteration Format). Named after Nabu, the Mesopotamian god of writing and scribes, this model bridges a 5,000-year gap between humanity's earliest writing system and cutting-edge computer vision.
22
+
23
+ ## Overview
24
+
25
+ NabuOCR processes images of cuneiform tablets and automatically generates scholarly transliterations in ATF format, the standard used by assyriologists worldwide. Built by fine-tuning PaddleOCR-VL on cuneiform tablet images, it can handle multiple views of tablets and produce complete transliterations including metadata.
26
+
27
+ ## Features
28
+
29
+ - **Multi-view Processing**: Handles obverse, reverse, and edge views of tablets
30
+ - **ATF Output**: Generates standard ATF format used by CDLI and other digital cuneiform projects
31
+ - **Robust Recognition**: Trained on diverse tablet conditions from multiple periods
32
+ - **Lightweight**: Based on the efficient 0.9B parameter PaddleOCR-VL model
33
+
34
+ ## Example Output
35
+
36
+ Given an image of a cuneiform tablet, NabuOCR generates:
37
+
38
+ ```
39
+ #atf: lang sux
40
+ @tablet
41
+ @obverse
42
+ 1. 1(disz) geme2 u4 1(disz)-sze3
43
+ 2. ki dingir-ra-ta
44
+ 3. da-da-ga
45
+ 4. szu ba-ti
46
+ @reverse
47
+ 1. mu ki-masz{ki} ba-hul
48
+ ```
49
+
50
+ ## Model Architecture
51
+
52
+ NabuOCR is built on PaddleOCR-VL, fine-tuned with:
53
+ - **Training Data**: [Specify dataset size] cuneiform tablet images from CDLI
54
+ - **Input Resolution**: 4096 max axis (automatically resized)
55
+ - **Output Format**: ATF standard transliteration
56
+ - **Languages Supported**: Sumerian (sux), Akkadian (akk), and other ancient Near Eastern languages
57
+
58
+ ## Usage Tips
59
+
60
+ ### Best Practices
61
+ - Provide high-resolution images when possible (minimum 800x800 recommended)
62
+ - Include all visible sides of the tablet in a single image or provide multiple views
63
+ - Ensure good lighting and contrast in photographs
64
+ - Remove excessive background from images
65
+
66
+
67
+ ## Performance
68
+
69
+ | Dataset | Character Accuracy | Line Accuracy | Full Tablet Accuracy |
70
+ |---------|-------------------|---------------|---------------------|
71
+ | Test Set | XX.X% | XX.X% | XX.X% |
72
+ | Old Babylonian | XX.X% | XX.X% | XX.X% |
73
+ | Neo-Assyrian | XX.X% | XX.X% | XX.X% |
74
+
75
+ ## Limitations
76
+
77
+ - Best performance on well-preserved tablets with clear impressions
78
+ - May struggle with heavily damaged or eroded sections
79
+ - Currently optimized for administrative and economic texts
80
+ - Limited support for complex literary texts with unusual sign variants
81
+
82
+ ## Citation
83
+
84
+ If you use NabuOCR in your research, please cite:
85
+
86
+ ```bibtex
87
+ @software{nabuocr2025,
88
+ title={NabuOCR: Neural Cuneiform Transliteration},
89
+ author={[Zack Williams]},
90
+ year={2025},
91
+ url={https://huggingface.co/boatbomber/NabuOCR}
92
+ }
93
+ ```
94
+
95
+ ## Acknowledgments
96
+
97
+ - Built on [PaddleOCR-VL](https://github.com/PaddlePaddle/PaddleOCR)
98
+ - Training data courtesy of the [Cuneiform Digital Library Initiative (CDLI)](https://cdli.ucla.edu/)
99
+ - ATF format specification from [ORACC](http://oracc.museum.upenn.edu/)
100
+
101
+ ---
102
+
103
+ *Bringing the ancient art of cuneiform into the age of artificial intelligence*