Update README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,20 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
### opus-mt-de-ZH
|
| 2 |
+
|
| 3 |
+
* source languages: de
|
| 4 |
+
* target languages: cmn,cn,yue,ze_zh,zh_cn,zh_CN,zh_HK,zh_tw,zh_TW,zh_yue,zhs,zht,zh
|
| 5 |
+
* OPUS readme: [de-cmn+cn+yue+ze_zh+zh_cn+zh_CN+zh_HK+zh_tw+zh_TW+zh_yue+zhs+zht+zh](https://github.com/Helsinki-NLP/OPUS-MT-train/blob/master/models/de-cmn+cn+yue+ze_zh+zh_cn+zh_CN+zh_HK+zh_tw+zh_TW+zh_yue+zhs+zht+zh/README.md)
|
| 6 |
+
|
| 7 |
+
* dataset: opus
|
| 8 |
+
* model: transformer-align
|
| 9 |
+
* pre-processing: normalization + SentencePiece
|
| 10 |
+
* a sentence initial language token is required in the form of `>>id<<` (id = valid target language ID)
|
| 11 |
+
* download original weights: [opus-2020-01-20.zip](https://object.pouta.csc.fi/OPUS-MT-models/de-cmn+cn+yue+ze_zh+zh_cn+zh_CN+zh_HK+zh_tw+zh_TW+zh_yue+zhs+zht+zh/opus-2020-01-20.zip)
|
| 12 |
+
* test set translations: [opus-2020-01-20.test.txt](https://object.pouta.csc.fi/OPUS-MT-models/de-cmn+cn+yue+ze_zh+zh_cn+zh_CN+zh_HK+zh_tw+zh_TW+zh_yue+zhs+zht+zh/opus-2020-01-20.test.txt)
|
| 13 |
+
* test set scores: [opus-2020-01-20.eval.txt](https://object.pouta.csc.fi/OPUS-MT-models/de-cmn+cn+yue+ze_zh+zh_cn+zh_CN+zh_HK+zh_tw+zh_TW+zh_yue+zhs+zht+zh/opus-2020-01-20.eval.txt)
|
| 14 |
+
|
| 15 |
+
## Benchmarks
|
| 16 |
+
|
| 17 |
+
| testset | BLEU | chr-F |
|
| 18 |
+
|-----------------------|-------|-------|
|
| 19 |
+
| bible-uedin.de.zh | 24.4 | 0.335 |
|
| 20 |
+
|