Opus Tatoeba | English -> Farsi
- dataset: opus
- model: transformer
- source language(s): eng
- target language(s): fas pes prs
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<(id = valid target language ID) - valid language labels: >>fas<< >>pes<< >>prs<<
- download: opus-2021-02-23.zip
- test set translations: opus-2021-02-23.test.txt
- test set scores: opus-2021-02-23.eval.txt
Benchmarks
| testset | BLEU | chr-F | #sent | #words | BP |
|---|---|---|---|---|---|
| Tatoeba-test.eng-fas | 11.8 | 0.364 | 7536 | 62270 | 0.924 |
| Tatoeba-test.eng-pes | 14.7 | 0.390 | 3763 | 31066 | 0.947 |
| Tatoeba-test.eng-pes_Latn | 0.9 | 0.000 | 3 | 26 | 0.741 |
| Tatoeba-test.eng-pes_Thaa | 0.9 | 0.003 | 2 | 40 | 1.000 |
| tico19-test.eng-fas | 13.7 | 0.422 | 2100 | 62758 | 0.826 |
- Downloads last month
- 5