ninagroot/GPT2-705Mtest

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 7.2750
 ## Model description
@@ -41,17 +41,23 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 300
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.91  | 2    | 7.9754          |
-| No log        | 1.83  | 4    | 8.3820          |
-| No log        | 2.74  | 6    | 7.3030          |
-| No log        | 3.66  | 8    | 7.2750          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.0408
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 300
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.91  | 2    | 8.3080          |
+| No log        | 1.83  | 4    | 8.4099          |
+| No log        | 2.74  | 6    | 7.3996          |
+| No log        | 3.66  | 8    | 7.2565          |
+| No log        | 4.57  | 10   | 6.9514          |
+| No log        | 5.94  | 13   | 6.5453          |
+| No log        | 6.86  | 15   | 6.3409          |
+| No log        | 7.77  | 17   | 6.1577          |
+| No log        | 8.69  | 19   | 6.5267          |
+| 7.123         | 9.14  | 20   | 6.0408          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4a31042b3f2f75ff763b29c7efc5775194dfa6daaf6e8667503f739e55ed0d66
 size 2747934496

 version https://git-lfs.github.com/spec/v1
+oid sha256:bd662d2dccbe4570d5feeec8fb6f2f2af6de8a84d02c8c69e363fae122e7a31e
 size 2747934496

runs/Mar20_14-27-44_gcn33.local.snellius.surf.nl/events.out.tfevents.1710941273.gcn33.local.snellius.surf.nl.2362503.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a3ff7a8c01c19bb19ddd3028e29e66149ae80374a47da5dda03e862eea8e39c7
+size 7598

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c2e0ba9cc652b9679de4450d828169451120f45fe0b3458e219a4f217af043a6
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:f1f1bf6fb4d7253fbad755965768a42ded4a1786e5b4ec84633c943289b78953
 size 4728