Training complete

Files changed (5) hide show

README.md CHANGED Viewed

@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 18.5420
-- Rouge1: 5.4719
-- Rouge2: 1.348
-- Rougel: 4.9456
-- Rougelsum: 4.9853
 ## Model description
@@ -53,14 +53,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
-| 31.6413       | 1.0   | 10   | 24.0170         | 3.8323 | 0.2047 | 3.7904 | 3.8284    |
-| 27.8266       | 2.0   | 20   | 23.3707         | 3.1294 | 0.2047 | 3.1291 | 3.1228    |
-| 26.1653       | 3.0   | 30   | 22.3135         | 3.4676 | 0.3738 | 3.4903 | 3.4927    |
-| 24.5148       | 4.0   | 40   | 21.0853         | 3.9753 | 0.7238 | 3.8535 | 3.7906    |
-| 24.3813       | 5.0   | 50   | 20.0264         | 4.4993 | 0.9797 | 4.2104 | 4.1768    |
-| 23.2283       | 6.0   | 60   | 19.1784         | 5.1955 | 1.1276 | 4.8347 | 4.8356    |
-| 23.186        | 7.0   | 70   | 18.7094         | 5.3472 | 1.2564 | 4.787  | 4.8648    |
-| 22.0852       | 8.0   | 80   | 18.5420         | 5.5064 | 1.3106 | 4.907  | 4.9318    |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 17.8785
+- Rouge1: 3.9305
+- Rouge2: 0.4293
+- Rougel: 3.82
+- Rougelsum: 3.8037
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
+| 32.4777       | 1.0   | 10   | 25.3500         | 4.3802 | 0.6591 | 4.2675 | 4.2475    |
+| 27.9251       | 2.0   | 20   | 21.6511         | 4.6739 | 1.2021 | 4.6067 | 4.5692    |
+| 26.5628       | 3.0   | 30   | 20.5428         | 4.6916 | 1.1508 | 4.5642 | 4.6407    |
+| 24.505        | 4.0   | 40   | 18.7937         | 4.1773 | 0.3385 | 4.0182 | 4.0615    |
+| 23.1436       | 5.0   | 50   | 18.3412         | 4.3078 | 0.3432 | 4.2093 | 4.2728    |
+| 22.6089       | 6.0   | 60   | 18.5093         | 3.5525 | 0.3453 | 3.484  | 3.4419    |
+| 23.8132       | 7.0   | 70   | 17.9128         | 3.8649 | 0.342  | 3.7392 | 3.7184    |
+| 22.3071       | 8.0   | 80   | 17.8785         | 3.9253 | 0.4131 | 3.7871 | 3.7766    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a2c5f84e63dae5282a3b6c465fcf37e0afc04d79223dc83ceac8206757d25c2d
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:1718517a1855a4b99c9b7352afc35380bbea2807296ba2a7bb5f12ddb11b7724
 size 1200729512

runs/Apr23_16-21-05_c4ea92c1ff40/events.out.tfevents.1713889287.c4ea92c1ff40.57208.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0c792425589a0da2028e865f3cbc1efa28ebb0efe84e2d2b0332d3ae7fcfe2a5
+size 10676

runs/Apr23_16-21-05_c4ea92c1ff40/events.out.tfevents.1713889339.c4ea92c1ff40.57208.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:cf342d8d512935b62a062b75b2e4f7e3af87f6b93bac2c9422d200eb5b39f1a4
+size 553

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5c7b0d0d7b8b1d4795cbdea25e42304ffe58cdc5701756d0944d5c8894d58d8c
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:b3789af83b0694159772d4dbf18e6f5938bdcbbffb3b9bd2cf2d8b811d92b6b6
 size 5112