Yuantao Feng
commited on
Commit
·
7295415
1
Parent(s):
85b92e6
Renaming model files to have more information on architecture, training data and more (#7)
Browse files* add suffix of training dataset, arch & upload time to each model
* update DB-IC15 benchmark results
README.md
CHANGED
|
@@ -2,11 +2,13 @@
|
|
| 2 |
|
| 3 |
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
|
| 4 |
|
| 5 |
-
|
|
|
|
|
|
|
| 6 |
|
| 7 |
## Demo
|
| 8 |
|
| 9 |
-
***NOTE
|
| 10 |
|
| 11 |
Run the following command to try the demo:
|
| 12 |
```shell
|
|
|
|
| 2 |
|
| 3 |
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
|
| 4 |
|
| 5 |
+
Note:
|
| 6 |
+
- Model source: https://docs.opencv.org/4.5.2/d9/d1e/tutorial_dnn_OCR.html.
|
| 7 |
+
- For details on training this model, please visit https://github.com/zihaomu/deep-text-recognition-benchmark, which can only recognize english words.
|
| 8 |
|
| 9 |
## Demo
|
| 10 |
|
| 11 |
+
***NOTE***: This demo uses [text_detection_db](../text_detection_db) as text detector.
|
| 12 |
|
| 13 |
Run the following command to try the demo:
|
| 14 |
```shell
|
demo.py
CHANGED
|
@@ -26,7 +26,7 @@ def str2bool(v):
|
|
| 26 |
parser = argparse.ArgumentParser(
|
| 27 |
description="An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition (https://arxiv.org/abs/1507.05717)")
|
| 28 |
parser.add_argument('--input', '-i', type=str, help='Path to the input image. Omit for using default camera.')
|
| 29 |
-
parser.add_argument('--model', '-m', type=str, default='
|
| 30 |
parser.add_argument('--width', type=int, default=736,
|
| 31 |
help='The width of input image being sent to the text detector.')
|
| 32 |
parser.add_argument('--height', type=int, default=736,
|
|
|
|
| 26 |
parser = argparse.ArgumentParser(
|
| 27 |
description="An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition (https://arxiv.org/abs/1507.05717)")
|
| 28 |
parser.add_argument('--input', '-i', type=str, help='Path to the input image. Omit for using default camera.')
|
| 29 |
+
parser.add_argument('--model', '-m', type=str, default='text_recognition_CRNN_VGG_BiLSTM_CTC.onnx', help='Path to the model.')
|
| 30 |
parser.add_argument('--width', type=int, default=736,
|
| 31 |
help='The width of input image being sent to the text detector.')
|
| 32 |
parser.add_argument('--height', type=int, default=736,
|