Training complete
Browse files
README.md
CHANGED
|
@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
-
- Loss:
|
| 22 |
-
- Rouge1:
|
| 23 |
-
- Rouge2:
|
| 24 |
-
- Rougel:
|
| 25 |
-
- Rougelsum:
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
@@ -53,14 +53,14 @@ The following hyperparameters were used during training:
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
| 55 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
|
| 56 |
-
|
|
| 57 |
-
| 27.
|
| 58 |
-
| 26.
|
| 59 |
-
| 24.
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
| 23.
|
| 63 |
-
| 22.
|
| 64 |
|
| 65 |
|
| 66 |
### Framework versions
|
|
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 17.8785
|
| 22 |
+
- Rouge1: 3.9305
|
| 23 |
+
- Rouge2: 0.4293
|
| 24 |
+
- Rougel: 3.82
|
| 25 |
+
- Rougelsum: 3.8037
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
| 55 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
|
| 56 |
+
| 32.4777 | 1.0 | 10 | 25.3500 | 4.3802 | 0.6591 | 4.2675 | 4.2475 |
|
| 57 |
+
| 27.9251 | 2.0 | 20 | 21.6511 | 4.6739 | 1.2021 | 4.6067 | 4.5692 |
|
| 58 |
+
| 26.5628 | 3.0 | 30 | 20.5428 | 4.6916 | 1.1508 | 4.5642 | 4.6407 |
|
| 59 |
+
| 24.505 | 4.0 | 40 | 18.7937 | 4.1773 | 0.3385 | 4.0182 | 4.0615 |
|
| 60 |
+
| 23.1436 | 5.0 | 50 | 18.3412 | 4.3078 | 0.3432 | 4.2093 | 4.2728 |
|
| 61 |
+
| 22.6089 | 6.0 | 60 | 18.5093 | 3.5525 | 0.3453 | 3.484 | 3.4419 |
|
| 62 |
+
| 23.8132 | 7.0 | 70 | 17.9128 | 3.8649 | 0.342 | 3.7392 | 3.7184 |
|
| 63 |
+
| 22.3071 | 8.0 | 80 | 17.8785 | 3.9253 | 0.4131 | 3.7871 | 3.7766 |
|
| 64 |
|
| 65 |
|
| 66 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1200729512
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1718517a1855a4b99c9b7352afc35380bbea2807296ba2a7bb5f12ddb11b7724
|
| 3 |
size 1200729512
|
runs/Apr23_16-21-05_c4ea92c1ff40/events.out.tfevents.1713889287.c4ea92c1ff40.57208.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0c792425589a0da2028e865f3cbc1efa28ebb0efe84e2d2b0332d3ae7fcfe2a5
|
| 3 |
+
size 10676
|
runs/Apr23_16-21-05_c4ea92c1ff40/events.out.tfevents.1713889339.c4ea92c1ff40.57208.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cf342d8d512935b62a062b75b2e4f7e3af87f6b93bac2c9422d200eb5b39f1a4
|
| 3 |
+
size 553
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5112
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b3789af83b0694159772d4dbf18e6f5938bdcbbffb3b9bd2cf2d8b811d92b6b6
|
| 3 |
size 5112
|