ipa_ocr / README.md
AlienKevin's picture
Update README.md
89d5a75
|
raw
history blame
723 Bytes
metadata
license: mit
language:
  - zh
pipeline_tag: image-to-text

Convert images of IPA phonetic symbols to Pinyin.

Target: Convert Scanned IPA symbols to Pinyin

Scanned images of IPA phonetic symbols for Chengdunese (成都话) in The Great Dictionary of Modern Chinese Dialects (現代漢語方言大詞典).

TODO: labeled part of the test set.

Training and Test Set

  • 2,553 images of IPA phonetic symbols generated from Pinyin pronunciations found in Sichuanese Dialect Dictionary (四川方言词典 教你一口地道的四川话) and the word list of the Shupin (蜀拼) input method.
  • 80/20 split on train/test

Results

  • Trained for 180 steps with a batch size of 32
  • Final Character Error Rate of 0.795%