mayocream
/

manga-ocr

vision-encoder-decoder

Model card Files Files and versions

manga-ocr / README.md

mayocream's picture

Upload folder using huggingface_hub

095f762 verified about 1 month ago

|

history blame contribute delete

705 Bytes

	---
	language: ja
	tags:
	- image-to-text
	license: apache-2.0
	datasets:
	- manga109s
	---

	# Manga OCR

	Optical character recognition for Japanese text, with the main focus being Japanese manga.

	It uses [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/vision-encoder-decoder) framework.

	Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality
	text recognition, robust against various scenarios specific to manga:
	- both vertical and horizontal text
	- text with furigana
	- text overlaid on images
	- wide variety of fonts and font styles
	- low quality images

	Code is available [here](https://github.com/kha-white/manga_ocr).