A newer version of this model is available: hezarai/crnn-base-fa-v2

A CRNN model for Persian OCR. This model is based on a simple CNN + LSTM architecture inspired by this paper. More info about data and training will be provided soon.

Note that this model is only optimized for scanned documents and supports input characters of up to 32 (For an end-to-end OCR pipeline, use a text detector model first to extract text boxes preferrably in word-level and then use this model), but it can be used to be fine-tuned on other domains like license plate or handwritten texts.

Limitations

This model is best suited for Persian alphabet and lacks the ability to recognize numbers and digits properly. We'll soon retrain this model to fit all scenarios.

Usage

pip install hezar

from hezar.models import Model

crnn = Model.load("hezarai/crnn-base-fa-v1")
texts = crnn.predict(["sample_image.jpg"])
print(texts)

Downloads last month: 90

Collection including hezarai/crnn-base-fa-v1

Computer Vision

Collection

Computer vision models, datasets, etc. • 10 items • Updated 5 days ago

Paper for hezarai/crnn-base-fa-v1

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Paper • 1507.05717 • Published Jul 21, 2015