Opus Tatoeba | English -> Farsi

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): fas pes prs
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • valid language labels: >>fas<< >>pes<< >>prs<<
  • download: opus-2021-02-23.zip
  • test set translations: opus-2021-02-23.test.txt
  • test set scores: opus-2021-02-23.eval.txt

Benchmarks

testset BLEU chr-F #sent #words BP
Tatoeba-test.eng-fas 11.8 0.364 7536 62270 0.924
Tatoeba-test.eng-pes 14.7 0.390 3763 31066 0.947
Tatoeba-test.eng-pes_Latn 0.9 0.000 3 26 0.741
Tatoeba-test.eng-pes_Thaa 0.9 0.003 2 40 1.000
tico19-test.eng-fas 13.7 0.422 2100 62758 0.826
Downloads last month
4
Safetensors
Model size
0.1B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support