FiveC
/

za_zh_sc

text2text-generation

Generated from Trainer

Model card Files Files and versions

FiveC commited on Mar 14

Commit

cb4f8db

·

verified ·

1 Parent(s): f932c2c

End of training

Files changed (2) hide show

README.md +5 -6
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.6823
-- Sacrebleu: 6.0325
 ## Model description
@@ -43,16 +43,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|
-| 2.4359        | 1.0   | 309  | 2.9542          | 3.4303    |
-| 1.5757        | 2.0   | 618  | 2.7130          | 4.8701    |
-| 1.2767        | 3.0   | 927  | 2.6823          | 6.0325    |
 ### Framework versions

 This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.9950
+- Sacrebleu: 4.4405
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|
+| 2.5108        | 1.0   | 309  | 2.9950          | 4.4405    |
+| 1.7678        | 2.0   | 618  | 2.8134          | 3.4752    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2a8c993a3baf7a1c2f566869054bbc3fea35b85db960e122d8158e031ef54fe0
 size 2444582788

 version https://git-lfs.github.com/spec/v1
+oid sha256:e0e7cc73ae9b3435c2455575f9d3217bdf54f799f014084d890f3f42b524e902
 size 2444582788