HiTZ
/

cap-punct-eu

@@ -73,7 +73,7 @@ Normalized: informazio gehiago hitz puntu e hatxe u puntu eus web horrian
 ## Training
 ### Data preparation
-The training data was compiled by our research group from multiple heterogeneous sources and consists of approximately 9,784,905 sentences.
 Prior to training, the data underwent preprocessing steps including cleaning, punctuation standardization, filtering, and the creation of aligned input–output sentence pairs for the capitalization and punctuation restoration task.

 ## Training
 ### Data preparation
+The training data was compiled by our research group from multiple heterogeneous sources and consists of approximately 9,784,905 sentences. This dataset is a subset of the data used in the training of the following machine translation model [mt-hitz-eu-es](https://huggingface.co/HiTZ/mt-hitz-eu-es)
 Prior to training, the data underwent preprocessing steps including cleaning, punctuation standardization, filtering, and the creation of aligned input–output sentence pairs for the capitalization and punctuation restoration task.