jumava
/

mbart-neutralization

text2text-generation

Generated from Trainer

Model card Files Files and versions

jumava commited on Apr 25

Commit

5a6b07d

·

verified ·

1 Parent(s): 1a72b63

Training complete

Files changed (1) hide show

README.md +10 -10

README.md CHANGED Viewed

@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0122
-- Bleu: 98.5517
-- Gen Len: 18.625
 ## Model description
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
-- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 2
@@ -52,13 +52,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
-| No log        | 1.0   | 440  | 0.0188          | 98.4685 | 18.5625 |
-| 0.2267        | 2.0   | 880  | 0.0122          | 98.5517 | 18.625  |
 ### Framework versions
-- Transformers 4.51.3
-- Pytorch 2.6.0+cu124
-- Datasets 3.5.0
-- Tokenizers 0.21.1

 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0110
+- Bleu: 98.5304
+- Gen Len: 18.6146
 ## Model description
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 2
 | Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| No log        | 1.0   | 440  | 0.0163          | 98.8446 | 18.6146 |
+| 0.2099        | 2.0   | 880  | 0.0110          | 98.5304 | 18.6146 |
 ### Framework versions
+- Transformers 5.6.2
+- Pytorch 2.10.0+cu128
+- Datasets 4.8.4
+- Tokenizers 0.22.2