opus-mt-en-pt

Overview

This repository contains the English-to-Portuguese OPUS-MT transformer model from Helsinki-NLP/opus-mt-tc-big-en-pt.

Source

Source model: Helsinki-NLP/opus-mt-tc-big-en-pt

The upstream model card describes this as a neural machine translation model for translating from English (en) to Portuguese (pt). It is part of the OPUS-MT project and was trained with OPUS data using Marian NMT, then converted for use with Transformers.

Files

File Description
model.safetensors Model weights
config.json Marian model configuration
generation_config.json Text generation configuration
source.spm Source SentencePiece model
target.spm Target SentencePiece model
vocab.json Vocabulary
tokenizer_config.json Tokenizer configuration
special_tokens_map.json Special token mapping

Intended Use

Use this model for English-to-Portuguese machine translation with Transformers-compatible Marian tooling.

Training Data

The upstream model card states that training data is taken from OPUS.

Evaluation

The upstream model card reports self-reported BLEU scores for English-to-Portuguese translation:

Test Set BLEU
flores101-devtest 50.4
tatoeba-test-v2021-08-07 49.6

Limitations

  • Evaluation scores are inherited from the upstream model card and are not independently verified here.
  • Translation quality can vary by domain and input style.
  • This repository does not document additional fine-tuning or conversion beyond the upstream source model files.

License

CC-BY-4.0, matching the upstream source model metadata.

Downloads last month
16
Safetensors
Model size
0.2B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tonythethompson/opus-mt-en-pt

Finetuned
(6)
this model