Model save
Browse files- README.md +17 -10
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -15,12 +15,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 15 |
|
| 16 |
This model was trained from scratch on an unknown dataset.
|
| 17 |
It achieves the following results on the evaluation set:
|
| 18 |
-
- Loss: 0.
|
| 19 |
-
- Rouge1:
|
| 20 |
-
- Rouge2:
|
| 21 |
-
- Rougel:
|
| 22 |
-
- Rougelsum:
|
| 23 |
-
- Gen Len: 16.
|
| 24 |
|
| 25 |
## Model description
|
| 26 |
|
|
@@ -45,15 +45,22 @@ The following hyperparameters were used during training:
|
|
| 45 |
- seed: 42
|
| 46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 47 |
- lr_scheduler_type: linear
|
| 48 |
-
- num_epochs:
|
| 49 |
|
| 50 |
### Training results
|
| 51 |
|
| 52 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 53 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
| 54 |
-
| No log | 1.0 | 76 | 0.
|
| 55 |
-
| No log | 2.0 | 152 | 0.
|
| 56 |
-
| No log | 3.0 | 228 | 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 57 |
|
| 58 |
|
| 59 |
### Framework versions
|
|
|
|
| 15 |
|
| 16 |
This model was trained from scratch on an unknown dataset.
|
| 17 |
It achieves the following results on the evaluation set:
|
| 18 |
+
- Loss: 0.6757
|
| 19 |
+
- Rouge1: 73.1568
|
| 20 |
+
- Rouge2: 56.3431
|
| 21 |
+
- Rougel: 73.2739
|
| 22 |
+
- Rougelsum: 73.2387
|
| 23 |
+
- Gen Len: 16.5471
|
| 24 |
|
| 25 |
## Model description
|
| 26 |
|
|
|
|
| 45 |
- seed: 42
|
| 46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 47 |
- lr_scheduler_type: linear
|
| 48 |
+
- num_epochs: 10
|
| 49 |
|
| 50 |
### Training results
|
| 51 |
|
| 52 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 53 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
| 54 |
+
| No log | 1.0 | 76 | 0.7087 | 72.4084 | 55.2193 | 72.4899 | 72.4272 | 16.5916 |
|
| 55 |
+
| No log | 2.0 | 152 | 0.6998 | 72.7389 | 55.4449 | 72.7601 | 72.7258 | 16.5497 |
|
| 56 |
+
| No log | 3.0 | 228 | 0.6946 | 72.674 | 55.5275 | 72.7467 | 72.712 | 16.5288 |
|
| 57 |
+
| No log | 4.0 | 304 | 0.6888 | 72.7071 | 55.7658 | 72.7673 | 72.7402 | 16.5524 |
|
| 58 |
+
| No log | 5.0 | 380 | 0.6829 | 72.8829 | 55.8072 | 72.9415 | 72.9187 | 16.5602 |
|
| 59 |
+
| No log | 6.0 | 456 | 0.6801 | 73.067 | 55.9923 | 73.137 | 73.1117 | 16.5681 |
|
| 60 |
+
| 0.8082 | 7.0 | 532 | 0.6791 | 73.1192 | 56.0297 | 73.2107 | 73.1619 | 16.5707 |
|
| 61 |
+
| 0.8082 | 8.0 | 608 | 0.6768 | 73.0697 | 56.0297 | 73.1433 | 73.1279 | 16.5785 |
|
| 62 |
+
| 0.8082 | 9.0 | 684 | 0.6763 | 72.9717 | 55.9654 | 73.0873 | 73.0365 | 16.5576 |
|
| 63 |
+
| 0.8082 | 10.0 | 760 | 0.6757 | 73.1568 | 56.3431 | 73.2739 | 73.2387 | 16.5471 |
|
| 64 |
|
| 65 |
|
| 66 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 242041896
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dcf6bb7f5be98a0b03f9f42ba56e808363fc3041fdd81940521b972d879d0afc
|
| 3 |
size 242041896
|