kazandaev commited on
Commit
ef0d585
·
1 Parent(s): a893166

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -10
README.md CHANGED
@@ -15,8 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.7716
19
- - Bleu: 13.1062
20
  - Gen Len: 17.8687
21
 
22
  ## Model description
@@ -44,17 +44,15 @@ The following hyperparameters were used during training:
44
  - total_train_batch_size: 160
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - num_epochs: 5
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
52
- |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|
53
- | 0.856 | 1.0 | 9641 | 0.8368 | 12.1924 | 17.8903 |
54
- | 0.8281 | 2.0 | 19282 | 0.8107 | 12.5703 | 17.8566 |
55
- | 0.8017 | 3.0 | 28923 | 0.7904 | 12.7893 | 17.8793 |
56
- | 0.7788 | 4.0 | 38564 | 0.7779 | 13.0086 | 17.8712 |
57
- | 0.7673 | 5.0 | 48205 | 0.7716 | 13.1062 | 17.8687 |
58
 
59
 
60
  ### Framework versions
 
15
 
16
  This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.7470
19
+ - Bleu: 13.6896
20
  - Gen Len: 17.8687
21
 
22
  ## Model description
 
44
  - total_train_batch_size: 160
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 3
48
 
49
  ### Training results
50
 
51
+ | Training Loss | Epoch | Step | Bleu | Gen Len | Validation Loss |
52
+ |:-------------:|:-----:|:-----:|:-------:|:-------:|:---------------:|
53
+ | 0.722 | 1.0 | 9641 | 13.3667 | 17.8985 | 0.7757 |
54
+ | 0.7098 | 2.0 | 19282 | 13.5842 | 17.8765 | 0.7571 |
55
+ | 0.7028 | 3.0 | 28923 | 0.7470 | 13.6896 | 17.8687 |
 
 
56
 
57
 
58
  ### Framework versions