Duplicated from kha-white/manga-ocr-base

agiera
/

manga-ocr-base

Feature Extraction

vision-encoder-decoder

Model card Files Files and versions

manga-ocr-base / README.md

agiera's picture

Update README.md

d0377f6 over 2 years ago

|

history blame contribute delete

748 Bytes

	---
	language: ja
	tags:
	- image-to-text
	- feature-extraction
	library_name: generic
	license: apache-2.0
	datasets:
	- manga109s
	---

	# Manga OCR

	Optical character recognition for Japanese text, with the main focus being Japanese manga.

	It uses [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/vision-encoder-decoder) framework.

	Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality
	text recognition, robust against various scenarios specific to manga:
	- both vertical and horizontal text
	- text with furigana
	- text overlaid on images
	- wide variety of fonts and font styles
	- low quality images

	Code is available [here](https://github.com/kha-white/manga_ocr).