Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,8 @@ library_name: fairseq
|
|
| 12 |
## Model description
|
| 13 |
|
| 14 |
This model was trained from scratch using the [Fairseq toolkit](https://fairseq.readthedocs.io/en/latest/) on a combination of Basque-Catalan datasets
|
| 15 |
-
totalling approximately 75 million sentence pairs.Parallel Basque-Catalan data was collected from
|
|
|
|
| 16 |
|
| 17 |
## Intended uses and limitations
|
| 18 |
|
|
|
|
| 12 |
## Model description
|
| 13 |
|
| 14 |
This model was trained from scratch using the [Fairseq toolkit](https://fairseq.readthedocs.io/en/latest/) on a combination of Basque-Catalan datasets
|
| 15 |
+
totalling approximately 75 million sentence pairs.Parallel Basque-Catalan data was collected from Opus and additional synthetic data was created from
|
| 16 |
+
the [Projecte Aina ES-CA corpus](https://huggingface.co/projecte-aina/mt-aina-ca-es) by translating the Spanish side using the ES-EU translator of [HiTZ](http://hitz.eus/). The model was evaluated on the Flores and NTREX evaluation datasets.
|
| 17 |
|
| 18 |
## Intended uses and limitations
|
| 19 |
|