End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7249
-- Accuracy: 0.8669
-- F1: 0.8749
-- Auc Roc: 0.9317
-- Log Loss: 0.7249
 ## Model description
@@ -50,15 +50,19 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Auc Roc | Log Loss |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:-------:|:--------:|
-| 1.1736        | 1.0   | 1618 | 0.6146          | 0.8507   | 0.8592 | 0.9256  | 0.6146   |
-| 0.6452        | 2.0   | 3236 | 0.7249          | 0.8669   | 0.8749 | 0.9317  | 0.7249   |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9470
+- Accuracy: 0.8644
+- F1: 0.8728
+- Auc Roc: 0.9185
+- Log Loss: 0.9470
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 6
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Auc Roc | Log Loss |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:-------:|:--------:|
+| 1.0534        | 1.0   | 1618 | 0.6479          | 0.8694   | 0.8696 | 0.9298  | 0.6479   |
+| 0.6971        | 2.0   | 3236 | 1.0859          | 0.8371   | 0.8581 | 0.9236  | 1.0859   |
+| 0.5832        | 3.0   | 4854 | 0.9261          | 0.8495   | 0.8672 | 0.9255  | 0.9261   |
+| 0.4402        | 4.0   | 6472 | 0.8507          | 0.8719   | 0.8804 | 0.9251  | 0.8507   |
+| 0.3475        | 5.0   | 8090 | 0.9284          | 0.8657   | 0.8735 | 0.9198  | 0.9283   |
+| 0.2985        | 6.0   | 9708 | 0.9470          | 0.8644   | 0.8728 | 0.9185  | 0.9470   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:02f0b5d0a85a3b873b4a183a19c22b0acf256a4c08e0f9b1355b1c34211241e5
 size 497780432

 version https://git-lfs.github.com/spec/v1
+oid sha256:bba396cdd3954fa6449e56fd26d07b8e71a29415aa7f0202a2844805faf46396
 size 497780432

runs/Jan24_20-13-09_7d2b1e2d4e1a/events.out.tfevents.1706127190.7d2b1e2d4e1a.26662.10 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:98f8c469a0ac1b736fdfd722766b3a5e4730843bb57eb3a0558723ec437aebca
+size 8670

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3de59719afd225fb332ed6d1b42b6d228a99601e6a3b5db147d421b6288ddab6
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:4687dd4a4162f59df791f1cccd877165e6c3d736f89908c5329d32a5eaea2f11
 size 4664