End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.1667
 ## Model description
@@ -41,15 +41,22 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 140  | 3.2615          |
-| No log        | 2.0   | 280  | 3.1849          |
-| No log        | 3.0   | 420  | 3.1667          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.9980
 ## Model description
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 140  | 3.2420          |
+| No log        | 2.0   | 280  | 3.1426          |
+| No log        | 3.0   | 420  | 3.0921          |
+| 3.2444        | 4.0   | 560  | 3.0593          |
+| 3.2444        | 5.0   | 700  | 3.0417          |
+| 3.2444        | 6.0   | 840  | 3.0215          |
+| 3.2444        | 7.0   | 980  | 3.0117          |
+| 2.9628        | 8.0   | 1120 | 3.0048          |
+| 2.9628        | 9.0   | 1260 | 3.0008          |
+| 2.9628        | 10.0  | 1400 | 2.9980          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:92b661b3dc94d492ca906ad22c65a2e8e619f6c963548306c92e82b9902b51ac
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:32b3ec8f54edd8e78bb84fa33faf365514d89ae12028972e9e356d0882955bfb
 size 327657928

runs/Dec10_22-17-17_ltrcgpu2/events.out.tfevents.1733849237.ltrcgpu2.3487582.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:78d52510d9e1d9c8dacde6a942668f2598e3137f2f23313c7aea4ddc8801fe3a
-size 7719

 version https://git-lfs.github.com/spec/v1
+oid sha256:27babb2a019a9e3ef5cfc82623e7118c6893d9f5217393ad168c6093acf400eb
+size 8886

runs/Dec10_22-17-17_ltrcgpu2/events.out.tfevents.1733849465.ltrcgpu2.3487582.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a4047a57e595858d07fecc6695d0246d9439728b43ed5b2c6fb6aca7c99eb4aa
+size 359