Opus Tatoeba | English -> Portuguese
- dataset: opus
- model: transformer
- source language(s): eng
- target language(s): pob por
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<< (id = valid target language ID)
- valid language labels: >>por<< >>pob<<
- download: opus-2021-02-18.zip
- test set translations: opus-2021-02-18.test.txt
- test set scores: opus-2021-02-18.eval.txt
Benchmarks
| testset |
BLEU |
chr-F |
#sent |
#words |
BP |
| Tatoeba-test.eng-por |
43.9 |
0.652 |
10000 |
75371 |
0.969 |