End of training
Browse files
README.md
CHANGED
|
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 17 |
|
| 18 |
This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
|
| 19 |
It achieves the following results on the evaluation set:
|
| 20 |
-
- Loss: 0.
|
| 21 |
-
- Rouge1: 0.
|
| 22 |
-
- Rouge2: 0.
|
| 23 |
-
- Rougel: 0.
|
| 24 |
-
- Rougelsum: 0.
|
| 25 |
-
- Gen Len: 18.
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
@@ -41,7 +41,7 @@ More information needed
|
|
| 41 |
### Training hyperparameters
|
| 42 |
|
| 43 |
The following hyperparameters were used during training:
|
| 44 |
-
- learning_rate:
|
| 45 |
- train_batch_size: 16
|
| 46 |
- eval_batch_size: 16
|
| 47 |
- seed: 42
|
|
@@ -54,26 +54,26 @@ The following hyperparameters were used during training:
|
|
| 54 |
|
| 55 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 56 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
| 57 |
-
| No log | 1.0 |
|
| 58 |
-
|
|
| 59 |
-
| 1.
|
| 60 |
-
| 1.
|
| 61 |
-
| 1.
|
| 62 |
-
| 1.
|
| 63 |
-
| 1.
|
| 64 |
-
| 1.
|
| 65 |
-
| 1.
|
| 66 |
-
| 1.
|
| 67 |
-
|
|
| 68 |
-
|
|
| 69 |
-
|
|
| 70 |
-
|
|
| 71 |
-
|
|
| 72 |
-
|
|
| 73 |
-
| 0.
|
| 74 |
-
| 0.
|
| 75 |
-
| 0.
|
| 76 |
-
| 0.
|
| 77 |
|
| 78 |
|
| 79 |
### Framework versions
|
|
|
|
| 17 |
|
| 18 |
This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
|
| 19 |
It achieves the following results on the evaluation set:
|
| 20 |
+
- Loss: 0.8614
|
| 21 |
+
- Rouge1: 0.422
|
| 22 |
+
- Rouge2: 0.3103
|
| 23 |
+
- Rougel: 0.4017
|
| 24 |
+
- Rougelsum: 0.4019
|
| 25 |
+
- Gen Len: 18.9192
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
|
|
| 41 |
### Training hyperparameters
|
| 42 |
|
| 43 |
The following hyperparameters were used during training:
|
| 44 |
+
- learning_rate: 3.419313942464226e-05
|
| 45 |
- train_batch_size: 16
|
| 46 |
- eval_batch_size: 16
|
| 47 |
- seed: 42
|
|
|
|
| 54 |
|
| 55 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 56 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
| 57 |
+
| No log | 1.0 | 239 | 1.0311 | 0.418 | 0.304 | 0.3985 | 0.3988 | 18.9267 |
|
| 58 |
+
| No log | 2.0 | 478 | 1.0058 | 0.4198 | 0.3065 | 0.4001 | 0.4004 | 18.9229 |
|
| 59 |
+
| 1.1809 | 3.0 | 717 | 0.9693 | 0.4215 | 0.3085 | 0.402 | 0.4024 | 18.9192 |
|
| 60 |
+
| 1.1809 | 4.0 | 956 | 0.9489 | 0.4208 | 0.3068 | 0.4016 | 0.402 | 18.9211 |
|
| 61 |
+
| 1.0899 | 5.0 | 1195 | 0.9402 | 0.4208 | 0.3074 | 0.4015 | 0.4019 | 18.9211 |
|
| 62 |
+
| 1.0899 | 6.0 | 1434 | 0.9204 | 0.4239 | 0.3125 | 0.4046 | 0.4048 | 18.9135 |
|
| 63 |
+
| 1.0455 | 7.0 | 1673 | 0.9111 | 0.4223 | 0.3094 | 0.4023 | 0.4024 | 18.9173 |
|
| 64 |
+
| 1.0455 | 8.0 | 1912 | 0.9055 | 0.4219 | 0.3106 | 0.4022 | 0.4024 | 18.9173 |
|
| 65 |
+
| 1.01 | 9.0 | 2151 | 0.8958 | 0.4218 | 0.3106 | 0.4016 | 0.4019 | 18.9154 |
|
| 66 |
+
| 1.01 | 10.0 | 2390 | 0.8901 | 0.4213 | 0.3106 | 0.4017 | 0.4022 | 18.9173 |
|
| 67 |
+
| 0.9841 | 11.0 | 2629 | 0.8828 | 0.4221 | 0.3117 | 0.4024 | 0.4029 | 18.9154 |
|
| 68 |
+
| 0.9841 | 12.0 | 2868 | 0.8749 | 0.4217 | 0.3102 | 0.4018 | 0.4021 | 18.9173 |
|
| 69 |
+
| 0.9599 | 13.0 | 3107 | 0.8755 | 0.4217 | 0.3104 | 0.4019 | 0.4023 | 18.9173 |
|
| 70 |
+
| 0.9599 | 14.0 | 3346 | 0.8733 | 0.4214 | 0.3103 | 0.4015 | 0.4016 | 18.9173 |
|
| 71 |
+
| 0.9487 | 15.0 | 3585 | 0.8701 | 0.4215 | 0.3097 | 0.4017 | 0.4019 | 18.9192 |
|
| 72 |
+
| 0.9487 | 16.0 | 3824 | 0.8663 | 0.4213 | 0.3099 | 0.4013 | 0.4016 | 18.9192 |
|
| 73 |
+
| 0.9396 | 17.0 | 4063 | 0.8647 | 0.4215 | 0.3092 | 0.4013 | 0.4015 | 18.9192 |
|
| 74 |
+
| 0.9396 | 18.0 | 4302 | 0.8621 | 0.4218 | 0.3098 | 0.4015 | 0.4018 | 18.9192 |
|
| 75 |
+
| 0.9329 | 19.0 | 4541 | 0.8615 | 0.422 | 0.3103 | 0.4017 | 0.4019 | 18.9192 |
|
| 76 |
+
| 0.9329 | 20.0 | 4780 | 0.8614 | 0.422 | 0.3103 | 0.4017 | 0.4019 | 18.9192 |
|
| 77 |
|
| 78 |
|
| 79 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 242041896
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b04247777b5238e366ecdb73cc09559fa7f077b5492dfdee408440a121b46840
|
| 3 |
size 242041896
|
runs/Mar15_02-36-45_45b5e1eda436/events.out.tfevents.1710470206.45b5e1eda436.573.2
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2d944af09e22e942de9c0a59589046caee47440117bf394313d82bb53764bc3c
|
| 3 |
+
size 18357
|