End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9977
-- F1: 71.1538
-- Gen Len: 2.0
 ## Model description
@@ -38,8 +38,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3669
+- F1: 59.791
+- Gen Len: 2.0469
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear

logs/events.out.tfevents.1739726211.f42d15cde534.31.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:da26e3f3065d55d17bc5ae24cae7779591d2af88935ade6cbad8f32178fcde09
-size 6646

 version https://git-lfs.github.com/spec/v1
+oid sha256:ed1a9f7b064a6a93aa15192e1168138d76f17369f64320e10ad9568cfccfc783
+size 7000

logs/events.out.tfevents.1739727027.f42d15cde534.31.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:10a55d7d9e3fcfa51eec5c069514ecc6a9de5ecfd37c3f576e72f6875247c711
+size 456

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 2,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 2
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 3,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 3
     },
     "direction": "Right",
     "pad_to_multiple_of": null,