lengocquangLAB commited on
Commit
ce573a6
·
verified ·
1 Parent(s): d724694

End of training

Browse files
Files changed (2) hide show
  1. README.md +7 -6
  2. model.safetensors +1 -1
README.md CHANGED
@@ -19,15 +19,15 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [T5-small](https://huggingface.co/T5-small) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 10.2871
23
  - Micro Precision: 0
24
  - Micro Recall: 0.0
25
  - Micro F1: 0
26
  - Macro Precision: 0.0
27
  - Macro Recall: 0.0
28
  - Macro F1: 0
29
- - Bleu: 0.0236
30
- - Rouge1: 0.0145
31
  - Rouge2: 0.0
32
 
33
  ## Model description
@@ -48,8 +48,8 @@ More information needed
48
 
49
  The following hyperparameters were used during training:
50
  - learning_rate: 2e-05
51
- - train_batch_size: 8
52
- - eval_batch_size: 8
53
  - seed: 42
54
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
55
  - lr_scheduler_type: linear
@@ -60,7 +60,8 @@ The following hyperparameters were used during training:
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 | Bleu | Rouge1 | Rouge2 |
62
  |:-------------:|:------:|:----:|:---------------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|:------:|:------:|:------:|
63
- | 15.4491 | 7.1429 | 50 | 10.2871 | 0 | 0.0 | 0 | 0.0 | 0.0 | 0 | 0.0236 | 0.0145 | 0.0 |
 
64
 
65
 
66
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [T5-small](https://huggingface.co/T5-small) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.9927
23
  - Micro Precision: 0
24
  - Micro Recall: 0.0
25
  - Micro F1: 0
26
  - Macro Precision: 0.0
27
  - Macro Recall: 0.0
28
  - Macro F1: 0
29
+ - Bleu: 0.0243
30
+ - Rouge1: 0.0133
31
  - Rouge2: 0.0
32
 
33
  ## Model description
 
48
 
49
  The following hyperparameters were used during training:
50
  - learning_rate: 2e-05
51
+ - train_batch_size: 4
52
+ - eval_batch_size: 4
53
  - seed: 42
54
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
55
  - lr_scheduler_type: linear
 
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 | Bleu | Rouge1 | Rouge2 |
62
  |:-------------:|:------:|:----:|:---------------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|:------:|:------:|:------:|
63
+ | 15.1542 | 3.8462 | 50 | 8.6615 | 0 | 0.0 | 0 | 0.0 | 0.0 | 0 | 0.0228 | 0.0126 | 0.0 |
64
+ | 4.6708 | 7.6923 | 100 | 0.9927 | 0 | 0.0 | 0 | 0.0 | 0.0 | 0 | 0.0243 | 0.0133 | 0.0 |
65
 
66
 
67
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3ef12ba2ecdc04ec6c615d79d2b87e5c1541e173c81bdec8fd110d2197ffb847
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8bb6849fbeb917bd800e61b24cc52bcf16c3a181db7f91a387b401d5f6540fed
3
  size 242041896