diallomama
/

ff-en

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0208
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0003
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -47,56 +47,56 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.2934        | 1.0   | 20   | 1.0480          |
-| 1.1576        | 2.0   | 40   | 0.9532          |
-| 1.0316        | 3.0   | 60   | 0.8803          |
-| 0.9428        | 4.0   | 80   | 0.8531          |
-| 0.8739        | 5.0   | 100  | 0.8284          |
-| 0.8312        | 6.0   | 120  | 0.8240          |
-| 0.7682        | 7.0   | 140  | 0.8247          |
-| 0.7325        | 8.0   | 160  | 0.8245          |
-| 0.7102        | 9.0   | 180  | 0.8220          |
-| 0.6386        | 10.0  | 200  | 0.8228          |
-| 0.6317        | 11.0  | 220  | 0.8307          |
-| 0.5935        | 12.0  | 240  | 0.8297          |
-| 0.5636        | 13.0  | 260  | 0.8402          |
-| 0.5445        | 14.0  | 280  | 0.8468          |
-| 0.5208        | 15.0  | 300  | 0.8589          |
-| 0.4867        | 16.0  | 320  | 0.8629          |
-| 0.4706        | 17.0  | 340  | 0.8675          |
-| 0.4429        | 18.0  | 360  | 0.8722          |
-| 0.4201        | 19.0  | 380  | 0.8882          |
-| 0.4081        | 20.0  | 400  | 0.8949          |
-| 0.3923        | 21.0  | 420  | 0.9109          |
-| 0.3771        | 22.0  | 440  | 0.9141          |
-| 0.3734        | 23.0  | 460  | 0.9245          |
-| 0.3436        | 24.0  | 480  | 0.9314          |
-| 0.341         | 25.0  | 500  | 0.9347          |
-| 0.3193        | 26.0  | 520  | 0.9462          |
-| 0.2991        | 27.0  | 540  | 0.9538          |
-| 0.2994        | 28.0  | 560  | 0.9539          |
-| 0.2991        | 29.0  | 580  | 0.9703          |
-| 0.2922        | 30.0  | 600  | 0.9625          |
-| 0.2726        | 31.0  | 620  | 0.9682          |
-| 0.2641        | 32.0  | 640  | 0.9722          |
-| 0.2514        | 33.0  | 660  | 0.9779          |
-| 0.245         | 34.0  | 680  | 0.9853          |
-| 0.2578        | 35.0  | 700  | 0.9875          |
-| 0.2443        | 36.0  | 720  | 0.9915          |
-| 0.2389        | 37.0  | 740  | 0.9948          |
-| 0.2317        | 38.0  | 760  | 0.9973          |
-| 0.2236        | 39.0  | 780  | 0.9984          |
-| 0.2128        | 40.0  | 800  | 1.0058          |
-| 0.219         | 41.0  | 820  | 1.0122          |
-| 0.215         | 42.0  | 840  | 1.0137          |
-| 0.2076        | 43.0  | 860  | 1.0173          |
-| 0.2098        | 44.0  | 880  | 1.0147          |
-| 0.1976        | 45.0  | 900  | 1.0149          |
-| 0.1988        | 46.0  | 920  | 1.0170          |
-| 0.1941        | 47.0  | 940  | 1.0204          |
-| 0.2083        | 48.0  | 960  | 1.0206          |
-| 0.2007        | 49.0  | 980  | 1.0208          |
-| 0.1931        | 50.0  | 1000 | 1.0208          |
 ### Framework versions

 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8258
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 8.2472        | 1.0   | 20   | 3.2102          |
+| 2.8238        | 2.0   | 40   | 1.2139          |
+| 1.7661        | 3.0   | 60   | 1.1075          |
+| 1.4094        | 4.0   | 80   | 1.0537          |
+| 1.2869        | 5.0   | 100  | 1.0106          |
+| 1.2366        | 6.0   | 120  | 0.9804          |
+| 1.1731        | 7.0   | 140  | 0.9549          |
+| 1.1356        | 8.0   | 160  | 0.9422          |
+| 1.1196        | 9.0   | 180  | 0.9286          |
+| 1.031         | 10.0  | 200  | 0.9169          |
+| 1.0438        | 11.0  | 220  | 0.9014          |
+| 1.0231        | 12.0  | 240  | 0.9007          |
+| 1.0015        | 13.0  | 260  | 0.8829          |
+| 0.9908        | 14.0  | 280  | 0.8803          |
+| 0.995         | 15.0  | 300  | 0.8689          |
+| 0.951         | 16.0  | 320  | 0.8638          |
+| 0.948         | 17.0  | 340  | 0.8601          |
+| 0.9157        | 18.0  | 360  | 0.8551          |
+| 0.9074        | 19.0  | 380  | 0.8519          |
+| 0.9021        | 20.0  | 400  | 0.8506          |
+| 0.8898        | 21.0  | 420  | 0.8472          |
+| 0.8842        | 22.0  | 440  | 0.8448          |
+| 0.9024        | 23.0  | 460  | 0.8437          |
+| 0.858         | 24.0  | 480  | 0.8403          |
+| 0.8801        | 25.0  | 500  | 0.8381          |
+| 0.8441        | 26.0  | 520  | 0.8375          |
+| 0.8379        | 27.0  | 540  | 0.8358          |
+| 0.8403        | 28.0  | 560  | 0.8344          |
+| 0.8615        | 29.0  | 580  | 0.8333          |
+| 0.8697        | 30.0  | 600  | 0.8327          |
+| 0.8403        | 31.0  | 620  | 0.8314          |
+| 0.8373        | 32.0  | 640  | 0.8299          |
+| 0.8094        | 33.0  | 660  | 0.8292          |
+| 0.8023        | 34.0  | 680  | 0.8291          |
+| 0.8426        | 35.0  | 700  | 0.8289          |
+| 0.8275        | 36.0  | 720  | 0.8281          |
+| 0.8177        | 37.0  | 740  | 0.8278          |
+| 0.8183        | 38.0  | 760  | 0.8266          |
+| 0.8058        | 39.0  | 780  | 0.8262          |
+| 0.7929        | 40.0  | 800  | 0.8263          |
+| 0.8218        | 41.0  | 820  | 0.8261          |
+| 0.8198        | 42.0  | 840  | 0.8261          |
+| 0.7957        | 43.0  | 860  | 0.8259          |
+| 0.7966        | 44.0  | 880  | 0.8260          |
+| 0.7941        | 45.0  | 900  | 0.8260          |
+| 0.7771        | 46.0  | 920  | 0.8261          |
+| 0.7883        | 47.0  | 940  | 0.8260          |
+| 0.8113        | 48.0  | 960  | 0.8259          |
+| 0.8155        | 49.0  | 980  | 0.8258          |
+| 0.7782        | 50.0  | 1000 | 0.8258          |
 ### Framework versions