TheBug95 commited on
Commit
83d7e2c
·
verified ·
1 Parent(s): 28c5137

End of training

Browse files
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.8882
21
- - Rouge1: 0.4218
22
- - Rouge2: 0.3111
23
- - Rougel: 0.4018
24
- - Rougelsum: 0.4023
25
- - Gen Len: 18.9135
26
 
27
  ## Model description
28
 
@@ -41,7 +41,7 @@ More information needed
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
- - learning_rate: 2e-05
45
  - train_batch_size: 16
46
  - eval_batch_size: 16
47
  - seed: 42
@@ -54,26 +54,26 @@ The following hyperparameters were used during training:
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
- | No log | 1.0 | 299 | 1.0700 | 0.4139 | 0.3012 | 0.3938 | 0.3942 | 18.9323 |
58
- | 1.2654 | 2.0 | 598 | 1.0254 | 0.4179 | 0.3044 | 0.3981 | 0.3982 | 18.9323 |
59
- | 1.2654 | 3.0 | 897 | 0.9932 | 0.4206 | 0.3073 | 0.4006 | 0.4008 | 18.9305 |
60
- | 1.1542 | 4.0 | 1196 | 0.9763 | 0.4214 | 0.308 | 0.4011 | 0.4015 | 18.9267 |
61
- | 1.1542 | 5.0 | 1495 | 0.9559 | 0.4221 | 0.3099 | 0.4025 | 0.4027 | 18.9211 |
62
- | 1.1025 | 6.0 | 1794 | 0.9474 | 0.4222 | 0.3103 | 0.4027 | 0.4029 | 18.9211 |
63
- | 1.0741 | 7.0 | 2093 | 0.9354 | 0.4229 | 0.3108 | 0.4032 | 0.4035 | 18.9211 |
64
- | 1.0741 | 8.0 | 2392 | 0.9273 | 0.422 | 0.3108 | 0.4024 | 0.4026 | 18.9211 |
65
- | 1.0521 | 9.0 | 2691 | 0.9206 | 0.4232 | 0.3121 | 0.4036 | 0.4038 | 18.9211 |
66
- | 1.0521 | 10.0 | 2990 | 0.9144 | 0.4229 | 0.3122 | 0.4034 | 0.4037 | 18.9173 |
67
- | 1.0354 | 11.0 | 3289 | 0.9089 | 0.4234 | 0.3134 | 0.4039 | 0.4043 | 18.9192 |
68
- | 1.0235 | 12.0 | 3588 | 0.9038 | 0.4233 | 0.3132 | 0.4039 | 0.4043 | 18.9135 |
69
- | 1.0235 | 13.0 | 3887 | 0.8994 | 0.4223 | 0.3124 | 0.4029 | 0.4035 | 18.9135 |
70
- | 1.0042 | 14.0 | 4186 | 0.8955 | 0.4225 | 0.3123 | 0.403 | 0.4034 | 18.9135 |
71
- | 1.0042 | 15.0 | 4485 | 0.8934 | 0.4225 | 0.3123 | 0.403 | 0.4035 | 18.9135 |
72
- | 1.0049 | 16.0 | 4784 | 0.8924 | 0.4223 | 0.3121 | 0.4029 | 0.4034 | 18.9135 |
73
- | 0.994 | 17.0 | 5083 | 0.8906 | 0.4223 | 0.3116 | 0.4028 | 0.4032 | 18.9098 |
74
- | 0.994 | 18.0 | 5382 | 0.8886 | 0.4223 | 0.3117 | 0.4026 | 0.4029 | 18.9135 |
75
- | 0.9899 | 19.0 | 5681 | 0.8885 | 0.4218 | 0.3111 | 0.4018 | 0.4023 | 18.9135 |
76
- | 0.9899 | 20.0 | 5980 | 0.8882 | 0.4218 | 0.3111 | 0.4018 | 0.4023 | 18.9135 |
77
 
78
 
79
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.8614
21
+ - Rouge1: 0.422
22
+ - Rouge2: 0.3103
23
+ - Rougel: 0.4017
24
+ - Rougelsum: 0.4019
25
+ - Gen Len: 18.9192
26
 
27
  ## Model description
28
 
 
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
+ - learning_rate: 3.419313942464226e-05
45
  - train_batch_size: 16
46
  - eval_batch_size: 16
47
  - seed: 42
 
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
+ | No log | 1.0 | 239 | 1.0311 | 0.418 | 0.304 | 0.3985 | 0.3988 | 18.9267 |
58
+ | No log | 2.0 | 478 | 1.0058 | 0.4198 | 0.3065 | 0.4001 | 0.4004 | 18.9229 |
59
+ | 1.1809 | 3.0 | 717 | 0.9693 | 0.4215 | 0.3085 | 0.402 | 0.4024 | 18.9192 |
60
+ | 1.1809 | 4.0 | 956 | 0.9489 | 0.4208 | 0.3068 | 0.4016 | 0.402 | 18.9211 |
61
+ | 1.0899 | 5.0 | 1195 | 0.9402 | 0.4208 | 0.3074 | 0.4015 | 0.4019 | 18.9211 |
62
+ | 1.0899 | 6.0 | 1434 | 0.9204 | 0.4239 | 0.3125 | 0.4046 | 0.4048 | 18.9135 |
63
+ | 1.0455 | 7.0 | 1673 | 0.9111 | 0.4223 | 0.3094 | 0.4023 | 0.4024 | 18.9173 |
64
+ | 1.0455 | 8.0 | 1912 | 0.9055 | 0.4219 | 0.3106 | 0.4022 | 0.4024 | 18.9173 |
65
+ | 1.01 | 9.0 | 2151 | 0.8958 | 0.4218 | 0.3106 | 0.4016 | 0.4019 | 18.9154 |
66
+ | 1.01 | 10.0 | 2390 | 0.8901 | 0.4213 | 0.3106 | 0.4017 | 0.4022 | 18.9173 |
67
+ | 0.9841 | 11.0 | 2629 | 0.8828 | 0.4221 | 0.3117 | 0.4024 | 0.4029 | 18.9154 |
68
+ | 0.9841 | 12.0 | 2868 | 0.8749 | 0.4217 | 0.3102 | 0.4018 | 0.4021 | 18.9173 |
69
+ | 0.9599 | 13.0 | 3107 | 0.8755 | 0.4217 | 0.3104 | 0.4019 | 0.4023 | 18.9173 |
70
+ | 0.9599 | 14.0 | 3346 | 0.8733 | 0.4214 | 0.3103 | 0.4015 | 0.4016 | 18.9173 |
71
+ | 0.9487 | 15.0 | 3585 | 0.8701 | 0.4215 | 0.3097 | 0.4017 | 0.4019 | 18.9192 |
72
+ | 0.9487 | 16.0 | 3824 | 0.8663 | 0.4213 | 0.3099 | 0.4013 | 0.4016 | 18.9192 |
73
+ | 0.9396 | 17.0 | 4063 | 0.8647 | 0.4215 | 0.3092 | 0.4013 | 0.4015 | 18.9192 |
74
+ | 0.9396 | 18.0 | 4302 | 0.8621 | 0.4218 | 0.3098 | 0.4015 | 0.4018 | 18.9192 |
75
+ | 0.9329 | 19.0 | 4541 | 0.8615 | 0.422 | 0.3103 | 0.4017 | 0.4019 | 18.9192 |
76
+ | 0.9329 | 20.0 | 4780 | 0.8614 | 0.422 | 0.3103 | 0.4017 | 0.4019 | 18.9192 |
77
 
78
 
79
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:305a8fc9b850bd4dc40ca42d865bc91cab4fbeb3425503ccc5ee43ec58b0e034
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b04247777b5238e366ecdb73cc09559fa7f077b5492dfdee408440a121b46840
3
  size 242041896
runs/Mar15_02-36-45_45b5e1eda436/events.out.tfevents.1710470206.45b5e1eda436.573.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7f35e1414fff6854937411edae953461be5e71f1342151b3fd5912f42b28de8d
3
- size 17478
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d944af09e22e942de9c0a59589046caee47440117bf394313d82bb53764bc3c
3
+ size 18357