jumava commited on
Commit
5a6b07d
·
verified ·
1 Parent(s): 1a72b63

Training complete

Browse files
Files changed (1) hide show
  1. README.md +10 -10
README.md CHANGED
@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.0122
23
- - Bleu: 98.5517
24
- - Gen Len: 18.625
25
 
26
  ## Model description
27
 
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
44
  - train_batch_size: 8
45
  - eval_batch_size: 8
46
  - seed: 42
47
- - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
  - num_epochs: 2
50
 
@@ -52,13 +52,13 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
55
- | No log | 1.0 | 440 | 0.0188 | 98.4685 | 18.5625 |
56
- | 0.2267 | 2.0 | 880 | 0.0122 | 98.5517 | 18.625 |
57
 
58
 
59
  ### Framework versions
60
 
61
- - Transformers 4.51.3
62
- - Pytorch 2.6.0+cu124
63
- - Datasets 3.5.0
64
- - Tokenizers 0.21.1
 
19
 
20
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.0110
23
+ - Bleu: 98.5304
24
+ - Gen Len: 18.6146
25
 
26
  ## Model description
27
 
 
44
  - train_batch_size: 8
45
  - eval_batch_size: 8
46
  - seed: 42
47
+ - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
  - num_epochs: 2
50
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
55
+ | No log | 1.0 | 440 | 0.0163 | 98.8446 | 18.6146 |
56
+ | 0.2099 | 2.0 | 880 | 0.0110 | 98.5304 | 18.6146 |
57
 
58
 
59
  ### Framework versions
60
 
61
+ - Transformers 5.6.2
62
+ - Pytorch 2.10.0+cu128
63
+ - Datasets 4.8.4
64
+ - Tokenizers 0.22.2