ninagroot/GPT2-705Mtest

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.7957
 ## Model description
@@ -41,22 +41,31 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 50
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.86  | 3    | 9.1712          |
-| No log        | 2.0   | 7    | 8.7604          |
-| No log        | 2.86  | 10   | 7.5925          |
-| No log        | 4.0   | 14   | 7.3624          |
-| No log        | 4.86  | 17   | 6.9348          |
-| 7.3776        | 6.0   | 21   | 6.4774          |
-| 7.3776        | 6.86  | 24   | 6.4065          |
-| 7.3776        | 8.0   | 28   | 5.9677          |
-| 7.3776        | 8.57  | 30   | 5.7957          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.5515
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 50
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.86  | 3    | 8.6420          |
+| No log        | 2.0   | 7    | 8.3017          |
+| No log        | 2.86  | 10   | 7.4703          |
+| No log        | 4.0   | 14   | 7.3033          |
+| No log        | 4.86  | 17   | 6.6681          |
+| 7.0997        | 6.0   | 21   | 6.2953          |
+| 7.0997        | 6.86  | 24   | 5.9852          |
+| 7.0997        | 8.0   | 28   | 5.7430          |
+| 7.0997        | 8.86  | 31   | 5.5474          |
+| 7.0997        | 10.0  | 35   | 5.5484          |
+| 7.0997        | 10.86 | 38   | 5.4335          |
+| 4.4904        | 12.0  | 42   | 5.4799          |
+| 4.4904        | 12.86 | 45   | 5.4311          |
+| 4.4904        | 14.0  | 49   | 5.7129          |
+| 4.4904        | 14.86 | 52   | 5.5459          |
+| 4.4904        | 16.0  | 56   | 5.5459          |
+| 4.4904        | 16.86 | 59   | 5.5522          |
+| 3.0663        | 17.14 | 60   | 5.5515          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1a97e4f33ec53e9ee3587a4c8d037c3739a5750718be3288f0206a3d76e6baa7
 size 2796386080

 version https://git-lfs.github.com/spec/v1
+oid sha256:ee4a9c5cdd3799705c0296fcf0aec464ca69228d3b8120854fc6e5eb7f36ae30
 size 2796386080

runs/Apr16_11-04-43_gcn13.local.snellius.surf.nl/events.out.tfevents.1713258292.gcn13.local.snellius.surf.nl.484961.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:cc25ab478c73a0a963ff044654a8079b34dce1f7cfb19feaed4eb7352c02de03
+size 10435

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9737ed2ac8b321c46a74a7040cd59b1107ef509adb81edf26aaff17959f3223c
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:7d76a6045c2ee2ccfa0812342fad889ec172e533d684e35763532afaa5c0e77d
 size 4984