End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4039
 ## Model description
@@ -40,22 +45,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 394  | 1.9051          |
-| 3.4843        | 2.0   | 788  | 1.4999          |
-| 1.8807        | 3.0   | 1182 | 1.4607          |
-| 1.7485        | 4.0   | 1576 | 1.4434          |
-| 1.7485        | 5.0   | 1970 | 1.4264          |
-| 1.6669        | 6.0   | 2364 | 1.4211          |
-| 1.6346        | 7.0   | 2758 | 1.4134          |
-| 1.6131        | 8.0   | 3152 | 1.4101          |
-| 1.6039        | 9.0   | 3546 | 1.4045          |
-| 1.6039        | 10.0  | 3940 | 1.4039          |
 ### Framework versions

 This model is a fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5497
+- Rouge Rouge1: 0.3896
+- Rouge Rouge2: 0.1402
+- Rouge Rougel: 0.227
+- Rouge Rougelsum: 0.2269
+- Gen Len: 392.0152
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge Rouge1 | Rouge Rouge2 | Rouge Rougel | Rouge Rougelsum | Gen Len  |
+|:-------------:|:-----:|:----:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:--------:|
+| No log        | 1.0   | 394  | 1.9062          | 0.0          | 0.0          | 0.0          | 0.0             | 0.0      |
+| 3.7289        | 2.0   | 788  | 1.5960          | 0.279        | 0.0991       | 0.1665       | 0.1662          | 320.4091 |
+| 2.0261        | 3.0   | 1182 | 1.5497          | 0.3896       | 0.1402       | 0.227        | 0.2269          | 392.0152 |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,5 +1,4 @@
 {
-  "_from_model_config": true,
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "max_length": 400,

 {
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "max_length": 400,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:efe4d62d8ebc7335fcfbaac1eda551f987d2096042dd3ecb79449fa31ed0b980
 size 990386200

 version https://git-lfs.github.com/spec/v1
+oid sha256:443ac3fb0c9b55856cf261149ebf2e1678c32bc2f3f117f695424070ae9aa34d
 size 990386200

runs/Dec03_21-15-34_user/events.out.tfevents.1733240735.user.1394782.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a46d1f80b226f86f263cb60985bd29c4d2a6629cb2c7e4405822a1462fbf7081
-size 6163

 version https://git-lfs.github.com/spec/v1
+oid sha256:94f43d4bab0cc2558c9604215b8d33c21c7659da782052f74840383f742434d5
+size 7066