ninagroot/GPT2-705Mtest

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 8.4040
 ## Model description
@@ -41,17 +41,22 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 50
-- num_epochs: 7
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 9.6206        | 0.57  | 1    | 9.6212          |
-| 8.0141        | 1.71  | 3    | 9.3085          |
-| 7.7709        | 2.86  | 5    | 7.8544          |
-| 8.4543        | 4.0   | 7    | 8.4040          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 7.2081
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 50
+- num_epochs: 15
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 9.6439        | 0.57  | 1    | 9.6409          |
+| 8.0775        | 1.71  | 3    | 9.6248          |
+| 8.2418        | 2.86  | 5    | 8.7251          |
+| 7.6995        | 4.0   | 7    | 8.0438          |
+| 7.4715        | 4.57  | 8    | 8.1243          |
+| 7.7918        | 5.71  | 10   | 7.7544          |
+| 7.1086        | 6.86  | 12   | 7.4231          |
+| 6.7919        | 8.0   | 14   | 7.1621          |
+| 6.4626        | 8.57  | 15   | 7.2081          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ae326a3dc782a1b33f60121c45c61ba8fa074d8a378a01954a6e1d53ff55672a
 size 2796386080

 version https://git-lfs.github.com/spec/v1
+oid sha256:2f1f9203db249777c29b82b484e27c1fc2e664be0559a016f326bdee7f55265f
 size 2796386080

runs/Apr16_14-32-37_gcn33.local.snellius.surf.nl/events.out.tfevents.1713270766.gcn33.local.snellius.surf.nl.184936.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d54eed2efdc1a4c909bf0b368fe34cf7950dc21f6a0eb01564e2796e9f70a3f6
+size 10525

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e18c8cd30fb93d9c4f2a922f1bb0b1c9bdec0d0f9e9d607e3614a92357eb1850
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:6e54f97170740d4086e8930b99a3fa778220e4cb0c9979b0c991f4a96c4a2129
 size 4984