Opus Tatoeba | English -> Portuguese
- dataset: opus
- model: transformer
- source language(s): eng
- target language(s): pob por
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<(id = valid target language ID) - valid language labels: >>por<< >>pob<<
- download: opus-2021-02-18.zip
- test set translations: opus-2021-02-18.test.txt
- test set scores: opus-2021-02-18.eval.txt
Benchmarks
| testset | BLEU | chr-F | #sent | #words | BP |
|---|---|---|---|---|---|
| Tatoeba-test.eng-por | 43.9 | 0.652 | 10000 | 75371 | 0.969 |
- Downloads last month
- 1