End of training
Browse files
README.md
CHANGED
|
@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
-
- Loss:
|
| 22 |
-
- Rouge1:
|
| 23 |
-
- Rouge2:
|
| 24 |
-
- Rougel:
|
| 25 |
-
- Rougelsum:
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
@@ -53,16 +53,16 @@ The following hyperparameters were used during training:
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
| 55 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
|
| 56 |
-
|
|
| 57 |
-
|
|
| 58 |
-
|
|
| 59 |
-
|
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
|
|
| 63 |
-
|
|
| 64 |
-
|
|
| 65 |
-
|
|
| 66 |
|
| 67 |
|
| 68 |
### Framework versions
|
|
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 0.0014
|
| 22 |
+
- Rouge1: 99.9541
|
| 23 |
+
- Rouge2: 87.8299
|
| 24 |
+
- Rougel: 99.9541
|
| 25 |
+
- Rougelsum: 99.9541
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
| 55 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
|
| 56 |
+
| 4.0259 | 1.0 | 376 | 2.0056 | 39.5517 | 23.9488 | 39.1195 | 39.0817 |
|
| 57 |
+
| 1.4477 | 2.0 | 752 | 0.6119 | 77.1176 | 63.7739 | 77.6351 | 77.12 |
|
| 58 |
+
| 0.7437 | 3.0 | 1128 | 0.0754 | 95.5267 | 84.31 | 95.5087 | 95.4086 |
|
| 59 |
+
| 0.3729 | 4.0 | 1504 | 0.0207 | 99.2424 | 86.3148 | 99.1965 | 99.1965 |
|
| 60 |
+
| 0.3327 | 5.0 | 1880 | 0.0119 | 99.2424 | 86.3148 | 99.1965 | 99.1965 |
|
| 61 |
+
| 0.2315 | 6.0 | 2256 | 0.0060 | 99.2424 | 86.3148 | 99.1965 | 99.1965 |
|
| 62 |
+
| 0.2843 | 7.0 | 2632 | 0.0029 | 99.9541 | 87.8299 | 99.9541 | 99.9541 |
|
| 63 |
+
| 0.1362 | 8.0 | 3008 | 0.0024 | 99.9541 | 87.8299 | 99.9541 | 99.9541 |
|
| 64 |
+
| 0.1723 | 9.0 | 3384 | 0.0016 | 99.9541 | 87.8299 | 99.9541 | 99.9541 |
|
| 65 |
+
| 0.1149 | 10.0 | 3760 | 0.0014 | 99.9541 | 87.8299 | 99.9541 | 99.9541 |
|
| 66 |
|
| 67 |
|
| 68 |
### Framework versions
|