Training complete
Browse files
README.md
CHANGED
|
@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
-
- Loss: 18.
|
| 22 |
-
- Rouge1:
|
| 23 |
-
- Rouge2:
|
| 24 |
-
- Rougel:
|
| 25 |
-
- Rougelsum:
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
@@ -53,14 +53,14 @@ The following hyperparameters were used during training:
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
| 55 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
|
| 56 |
-
| 31.
|
| 57 |
-
|
|
| 58 |
-
| 26.
|
| 59 |
-
| 24.
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
|
|
| 63 |
-
| 22.
|
| 64 |
|
| 65 |
|
| 66 |
### Framework versions
|
|
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 18.5420
|
| 22 |
+
- Rouge1: 5.4719
|
| 23 |
+
- Rouge2: 1.348
|
| 24 |
+
- Rougel: 4.9456
|
| 25 |
+
- Rougelsum: 4.9853
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
| 55 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
|
| 56 |
+
| 31.6413 | 1.0 | 10 | 24.0170 | 3.8323 | 0.2047 | 3.7904 | 3.8284 |
|
| 57 |
+
| 27.8266 | 2.0 | 20 | 23.3707 | 3.1294 | 0.2047 | 3.1291 | 3.1228 |
|
| 58 |
+
| 26.1653 | 3.0 | 30 | 22.3135 | 3.4676 | 0.3738 | 3.4903 | 3.4927 |
|
| 59 |
+
| 24.5148 | 4.0 | 40 | 21.0853 | 3.9753 | 0.7238 | 3.8535 | 3.7906 |
|
| 60 |
+
| 24.3813 | 5.0 | 50 | 20.0264 | 4.4993 | 0.9797 | 4.2104 | 4.1768 |
|
| 61 |
+
| 23.2283 | 6.0 | 60 | 19.1784 | 5.1955 | 1.1276 | 4.8347 | 4.8356 |
|
| 62 |
+
| 23.186 | 7.0 | 70 | 18.7094 | 5.3472 | 1.2564 | 4.787 | 4.8648 |
|
| 63 |
+
| 22.0852 | 8.0 | 80 | 18.5420 | 5.5064 | 1.3106 | 4.907 | 4.9318 |
|
| 64 |
|
| 65 |
|
| 66 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1200729512
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a2c5f84e63dae5282a3b6c465fcf37e0afc04d79223dc83ceac8206757d25c2d
|
| 3 |
size 1200729512
|
runs/Apr23_16-04-44_c4ea92c1ff40/events.out.tfevents.1713888319.c4ea92c1ff40.52352.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:03c9cd766da8b79701bbc940d3718092c2af2660a6e7b23bee9271cf67934c1e
|
| 3 |
+
size 10676
|
runs/Apr23_16-04-44_c4ea92c1ff40/events.out.tfevents.1713888363.c4ea92c1ff40.52352.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3011cc8f23f6df26e89263875f055beb6a6103f4e3f8326804b317988766bd2f
|
| 3 |
+
size 553
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5112
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5c7b0d0d7b8b1d4795cbdea25e42304ffe58cdc5701756d0944d5c8894d58d8c
|
| 3 |
size 5112
|