kushagra2004 commited on
Commit
83114ca
·
verified ·
1 Parent(s): ca6b2cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -1
README.md CHANGED
@@ -9,4 +9,30 @@ base_model:
9
  pipeline_tag: image-to-text
10
  tags:
11
  - finance
12
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  pipeline_tag: image-to-text
10
  tags:
11
  - finance
12
+ ---
13
+
14
+ ## Description
15
+
16
+ This repository provides three YOLO-based models intended to be used sequentially to extract and recognize digits from document images.
17
+
18
+ The pipeline works in three stages:
19
+
20
+ 1. **Region Segmentation**
21
+
22
+ * `segmenter.pt` (YOLOv11n)
23
+ * Finds the account number region in a document image
24
+ * Output is cropped and passed to the next stage
25
+
26
+ 2. **Digit Detection**
27
+
28
+ * `BBox.pt` (YOLOv11n)
29
+ * Detects bounding boxes for each digit within the cropped region
30
+ * Bounding boxes should be sorted left-to-right
31
+
32
+ 3. **Digit Classification**
33
+
34
+ * `Classify.pt` (YOLOv11s-cls)
35
+ * Classifies each cropped digit image into labels `0–9`
36
+ * Predictions are concatenated to form the final sequence
37
+
38
+ ---