davron112 commited on
Commit
c01fc7f
·
verified ·
1 Parent(s): 7ac0199

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -1
README.md CHANGED
@@ -7,4 +7,55 @@ tags:
7
  - kaa
8
  - qaraqalpaq
9
  - tesseract
10
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  - kaa
8
  - qaraqalpaq
9
  - tesseract
10
+ ---
11
+
12
+ # Tesseract OCR - Karakalpak Latin
13
+
14
+ Tesseract OCR trained data for Karakalpak language (Latin script).
15
+
16
+ ## Installation
17
+
18
+ Copy `kaa.traineddata` to your Tesseract tessdata directory:
19
+
20
+ ```bash
21
+ # Linux
22
+ cp kaa.traineddata /usr/share/tesseract-ocr/4.00/tessdata/
23
+
24
+ # macOS (Homebrew)
25
+ cp kaa.traineddata /opt/homebrew/share/tessdata/
26
+
27
+ # Windows
28
+ copy kaa.traineddata C:\Program Files\Tesseract-OCR\tessdata\
29
+ ```
30
+
31
+ ## Usage
32
+
33
+ ### CLI
34
+
35
+ ```bash
36
+ tesseract image.png output -l kaa
37
+ ```
38
+
39
+ ### Python
40
+
41
+ ```python
42
+ import pytesseract
43
+ from PIL import Image
44
+
45
+ text = pytesseract.image_to_string(Image.open('image.png'), lang='kaa')
46
+ print(text)
47
+ ```
48
+
49
+ ### Node.js
50
+
51
+ ```javascript
52
+ const Tesseract = require('tesseract.js');
53
+
54
+ Tesseract.recognize('image.png', 'kaa').then(({ data: { text } }) => {
55
+ console.log(text);
56
+ });
57
+ ```
58
+
59
+ ## Requirements
60
+
61
+ - Tesseract OCR 4.0+