Miranda
/

t5-small-train

@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.4110
-- Rouge1: 41.006
-- Rouge2: 18.9406
-- Rougel: 35.7319
-- Rougelsum: 35.9987
 ## Model description
@@ -46,27 +46,25 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
-| 2.5097        | 1.0   | 45   | 2.5765          | 36.9095 | 15.7531 | 32.4588 | 32.7501   |
-| 2.39          | 2.0   | 90   | 2.4823          | 39.1984 | 17.2602 | 34.5018 | 34.8303   |
-| 2.2862        | 3.0   | 135  | 2.4521          | 39.9179 | 18.2643 | 35.4775 | 35.7854   |
-| 2.2011        | 4.0   | 180  | 2.4314          | 40.1014 | 18.3646 | 35.274  | 35.5883   |
-| 2.1335        | 5.0   | 225  | 2.4240          | 40.1053 | 18.406  | 35.0905 | 35.3427   |
-| 2.0803        | 6.0   | 270  | 2.4178          | 41.1202 | 18.5746 | 35.5454 | 35.7857   |
-| 2.0662        | 7.0   | 315  | 2.4129          | 40.7965 | 18.5148 | 35.5866 | 35.8591   |
-| 2.0291        | 8.0   | 360  | 2.4103          | 40.7121 | 18.8736 | 35.6646 | 35.9392   |
-| 1.9807        | 9.0   | 405  | 2.4112          | 40.9464 | 18.9815 | 35.8468 | 36.1114   |
-| 1.9702        | 10.0  | 450  | 2.4110          | 41.006  | 18.9406 | 35.7319 | 35.9987   |
 ### Framework versions
 - Transformers 4.18.0
-- Pytorch 1.10.0+cu111
 - Datasets 2.1.0
 - Tokenizers 0.12.1

 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.3623
+- Rouge1: 40.5101
+- Rouge2: 19.0112
+- Rougel: 35.5748
+- Rougelsum: 35.9291
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 8
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 3.1374        | 1.0   | 45   | 2.6906          | 34.9598 | 15.4159 | 30.7378 | 30.9607   |
+| 2.598         | 2.0   | 90   | 2.5073          | 38.2818 | 16.4572 | 34.168  | 34.1708   |
+| 2.4287        | 3.0   | 135  | 2.4314          | 40.0863 | 18.3821 | 35.1633 | 35.441    |
+| 2.3109        | 4.0   | 180  | 2.3939          | 40.3133 | 18.9829 | 35.6333 | 35.8475   |
+| 2.2234        | 5.0   | 225  | 2.3762          | 40.405  | 18.7467 | 35.7971 | 36.035    |
+| 2.2274        | 6.0   | 270  | 2.3686          | 40.507  | 18.8308 | 35.5185 | 35.8219   |
+| 2.1655        | 7.0   | 315  | 2.3644          | 40.468  | 19.0659 | 35.6811 | 35.9991   |
+| 2.1741        | 8.0   | 360  | 2.3623          | 40.5101 | 19.0112 | 35.5748 | 35.9291   |
 ### Framework versions
 - Transformers 4.18.0
+- Pytorch 1.11.0+cu113
 - Datasets 2.1.0
 - Tokenizers 0.12.1