Commit ·
6b4cf51
1
Parent(s): 8a33719
Model save
Browse files
README.md
CHANGED
|
@@ -1,6 +1,4 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
base_model: google/flan-T5-small
|
| 4 |
tags:
|
| 5 |
- generated_from_trainer
|
| 6 |
metrics:
|
|
@@ -15,14 +13,14 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 15 |
|
| 16 |
# flanT5_small_title_desc_gen4
|
| 17 |
|
| 18 |
-
This model
|
| 19 |
It achieves the following results on the evaluation set:
|
| 20 |
-
- Loss:
|
| 21 |
-
- Rouge1:
|
| 22 |
-
- Rouge2:
|
| 23 |
-
- Rougel:
|
| 24 |
-
- Rougelsum:
|
| 25 |
-
- Gen Len:
|
| 26 |
|
| 27 |
## Model description
|
| 28 |
|
|
@@ -53,16 +51,16 @@ The following hyperparameters were used during training:
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 55 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
| 56 |
-
| No log | 1.0 | 108 |
|
| 57 |
-
| No log | 2.0 | 216 |
|
| 58 |
-
| No log | 3.0 | 324 |
|
| 59 |
-
| No log | 4.0 | 432 |
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
|
|
| 63 |
-
|
|
| 64 |
-
|
|
| 65 |
-
|
|
| 66 |
|
| 67 |
|
| 68 |
### Framework versions
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
| 2 |
tags:
|
| 3 |
- generated_from_trainer
|
| 4 |
metrics:
|
|
|
|
| 13 |
|
| 14 |
# flanT5_small_title_desc_gen4
|
| 15 |
|
| 16 |
+
This model was trained from scratch on the None dataset.
|
| 17 |
It achieves the following results on the evaluation set:
|
| 18 |
+
- Loss: 0.6014
|
| 19 |
+
- Rouge1: 2.7656
|
| 20 |
+
- Rouge2: 1.2373
|
| 21 |
+
- Rougel: 2.27
|
| 22 |
+
- Rougelsum: 2.3229
|
| 23 |
+
- Gen Len: 17.1667
|
| 24 |
|
| 25 |
## Model description
|
| 26 |
|
|
|
|
| 51 |
|
| 52 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 53 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
| 54 |
+
| No log | 1.0 | 108 | 0.9428 | 4.2395 | 2.8932 | 3.752 | 3.8235 | 12.1979 |
|
| 55 |
+
| No log | 2.0 | 216 | 0.8406 | 3.2873 | 1.8803 | 2.7746 | 2.8684 | 14.9479 |
|
| 56 |
+
| No log | 3.0 | 324 | 0.7626 | 3.0511 | 1.5565 | 2.535 | 2.6178 | 16.0625 |
|
| 57 |
+
| No log | 4.0 | 432 | 0.7117 | 2.7859 | 1.3242 | 2.2739 | 2.3331 | 16.6979 |
|
| 58 |
+
| 1.1608 | 5.0 | 540 | 0.6728 | 2.493 | 0.9561 | 1.9535 | 2.0105 | 17.625 |
|
| 59 |
+
| 1.1608 | 6.0 | 648 | 0.6452 | 2.5539 | 1.0342 | 2.0547 | 2.1008 | 17.5625 |
|
| 60 |
+
| 1.1608 | 7.0 | 756 | 0.6238 | 2.8349 | 1.2996 | 2.3204 | 2.3663 | 16.9688 |
|
| 61 |
+
| 1.1608 | 8.0 | 864 | 0.6113 | 2.8097 | 1.2806 | 2.3124 | 2.3623 | 17.0833 |
|
| 62 |
+
| 1.1608 | 9.0 | 972 | 0.6043 | 2.7542 | 1.2338 | 2.2593 | 2.3073 | 17.1979 |
|
| 63 |
+
| 0.8599 | 10.0 | 1080 | 0.6014 | 2.7656 | 1.2373 | 2.27 | 2.3229 | 17.1667 |
|
| 64 |
|
| 65 |
|
| 66 |
### Framework versions
|
logs/events.out.tfevents.1698957045.2138c942b38f.713.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:10d7a2ad09e6daf6e83624731fe1f9afa81373eff2d7690ae20342b95b797323
|
| 3 |
+
size 11240
|
logs/events.out.tfevents.1698958025.2138c942b38f.713.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:909241c84bbcea1b450e3803d7d336fc61406c8acf259c7f9860476b03119500
|
| 3 |
+
size 613
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 307867048
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c3734c1052023fcb89d6b1ca0fdf8fef19ec1f22fe530f0d8cd90d0cdff912eb
|
| 3 |
size 307867048
|