thabat commited on
Commit
aa7ebc4
·
verified ·
1 Parent(s): 37c7e00

Training complete

Browse files
README.md CHANGED
@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 18.5420
22
- - Rouge1: 5.4719
23
- - Rouge2: 1.348
24
- - Rougel: 4.9456
25
- - Rougelsum: 4.9853
26
 
27
  ## Model description
28
 
@@ -53,14 +53,14 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
55
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
56
- | 31.6413 | 1.0 | 10 | 24.0170 | 3.8323 | 0.2047 | 3.7904 | 3.8284 |
57
- | 27.8266 | 2.0 | 20 | 23.3707 | 3.1294 | 0.2047 | 3.1291 | 3.1228 |
58
- | 26.1653 | 3.0 | 30 | 22.3135 | 3.4676 | 0.3738 | 3.4903 | 3.4927 |
59
- | 24.5148 | 4.0 | 40 | 21.0853 | 3.9753 | 0.7238 | 3.8535 | 3.7906 |
60
- | 24.3813 | 5.0 | 50 | 20.0264 | 4.4993 | 0.9797 | 4.2104 | 4.1768 |
61
- | 23.2283 | 6.0 | 60 | 19.1784 | 5.1955 | 1.1276 | 4.8347 | 4.8356 |
62
- | 23.186 | 7.0 | 70 | 18.7094 | 5.3472 | 1.2564 | 4.787 | 4.8648 |
63
- | 22.0852 | 8.0 | 80 | 18.5420 | 5.5064 | 1.3106 | 4.907 | 4.9318 |
64
 
65
 
66
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 17.8785
22
+ - Rouge1: 3.9305
23
+ - Rouge2: 0.4293
24
+ - Rougel: 3.82
25
+ - Rougelsum: 3.8037
26
 
27
  ## Model description
28
 
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
55
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
56
+ | 32.4777 | 1.0 | 10 | 25.3500 | 4.3802 | 0.6591 | 4.2675 | 4.2475 |
57
+ | 27.9251 | 2.0 | 20 | 21.6511 | 4.6739 | 1.2021 | 4.6067 | 4.5692 |
58
+ | 26.5628 | 3.0 | 30 | 20.5428 | 4.6916 | 1.1508 | 4.5642 | 4.6407 |
59
+ | 24.505 | 4.0 | 40 | 18.7937 | 4.1773 | 0.3385 | 4.0182 | 4.0615 |
60
+ | 23.1436 | 5.0 | 50 | 18.3412 | 4.3078 | 0.3432 | 4.2093 | 4.2728 |
61
+ | 22.6089 | 6.0 | 60 | 18.5093 | 3.5525 | 0.3453 | 3.484 | 3.4419 |
62
+ | 23.8132 | 7.0 | 70 | 17.9128 | 3.8649 | 0.342 | 3.7392 | 3.7184 |
63
+ | 22.3071 | 8.0 | 80 | 17.8785 | 3.9253 | 0.4131 | 3.7871 | 3.7766 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a2c5f84e63dae5282a3b6c465fcf37e0afc04d79223dc83ceac8206757d25c2d
3
  size 1200729512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1718517a1855a4b99c9b7352afc35380bbea2807296ba2a7bb5f12ddb11b7724
3
  size 1200729512
runs/Apr23_16-21-05_c4ea92c1ff40/events.out.tfevents.1713889287.c4ea92c1ff40.57208.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c792425589a0da2028e865f3cbc1efa28ebb0efe84e2d2b0332d3ae7fcfe2a5
3
+ size 10676
runs/Apr23_16-21-05_c4ea92c1ff40/events.out.tfevents.1713889339.c4ea92c1ff40.57208.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cf342d8d512935b62a062b75b2e4f7e3af87f6b93bac2c9422d200eb5b39f1a4
3
+ size 553
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5c7b0d0d7b8b1d4795cbdea25e42304ffe58cdc5701756d0944d5c8894d58d8c
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b3789af83b0694159772d4dbf18e6f5938bdcbbffb3b9bd2cf2d8b811d92b6b6
3
  size 5112