File size: 741 Bytes
a393ef1
 
89d5a75
 
 
a393ef1
89d5a75
618458e
89d5a75
 
 
 
 
 
 
 
618458e
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
license: mit
language:
- zh
pipeline_tag: image-to-text
---

# Target: Convert Scanned Images of IPA symbols to Pinyin
Scanned images of IPA phonetic symbols for Chengdunese (成都话) in The Great Dictionary of Modern Chinese Dialects (現代漢語方言大詞典).

# Training and Test Set
* 2,553 images of IPA phonetic symbols generated from Pinyin pronunciations found in Sichuanese Dialect Dictionary (四川方言词典 教你一口地道的四川话) and the word list of the Shupin (蜀拼) input method.
* 80/20 split on train/test

# Results
* Trained for 180 steps with a batch size of 32
* Final Character Error Rate of 0.795% on test set
* TODO: label part of the scanned images to see if model generalizes on target task