d2niraj555
/

mt5-eng2nep

text2text-generation

English to Nepali Translator

Nepali Translator Dataset

Model card Files Files and versions

Metrics Training metrics Community

d2niraj555 commited on Oct 23, 2022

Commit

0f5087e

·

1 Parent(s): d91c131

update model card README.md

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.4136
 ## Model description
@@ -42,15 +42,19 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.9567        | 0.26  | 500  | 2.9944          |
-| 3.2601        | 0.53  | 1000 | 2.5355          |
-| 3.0768        | 0.79  | 1500 | 2.4136          |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1564
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.8723        | 0.26  | 500  | 2.9644          |
+| 3.2224        | 0.53  | 1000 | 2.5132          |
+| 2.9924        | 0.79  | 1500 | 2.3622          |
+| 2.842         | 1.05  | 2000 | 2.2799          |
+| 2.7975        | 1.32  | 2500 | 2.2186          |
+| 2.7526        | 1.58  | 3000 | 2.1765          |
+| 2.7554        | 1.84  | 3500 | 2.1564          |
 ### Framework versions