Model save
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 15 |
|
| 16 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
| 17 |
It achieves the following results on the evaluation set:
|
| 18 |
-
- Loss:
|
| 19 |
|
| 20 |
## Model description
|
| 21 |
|
|
@@ -44,32 +44,12 @@ The following hyperparameters were used during training:
|
|
| 44 |
|
| 45 |
### Training results
|
| 46 |
|
| 47 |
-
| Training Loss | Epoch
|
| 48 |
-
|:-------------:|:-----
|
| 49 |
-
|
|
| 50 |
-
|
|
| 51 |
-
|
|
| 52 |
-
|
|
| 53 |
-
| 2.0751 | 2.0492 | 500 | 2.3450 |
|
| 54 |
-
| 1.8368 | 2.4590 | 600 | 2.3120 |
|
| 55 |
-
| 1.9313 | 2.8689 | 700 | 2.2539 |
|
| 56 |
-
| 1.7337 | 3.2787 | 800 | 2.2305 |
|
| 57 |
-
| 1.6605 | 3.6885 | 900 | 2.2210 |
|
| 58 |
-
| 1.5663 | 4.0984 | 1000 | 2.2515 |
|
| 59 |
-
| 1.5205 | 4.5082 | 1100 | 2.2320 |
|
| 60 |
-
| 1.483 | 4.9180 | 1200 | 2.2033 |
|
| 61 |
-
| 1.3176 | 5.3279 | 1300 | 2.2459 |
|
| 62 |
-
| 1.355 | 5.7377 | 1400 | 2.2160 |
|
| 63 |
-
| 1.3254 | 6.1475 | 1500 | 2.2739 |
|
| 64 |
-
| 1.2533 | 6.5574 | 1600 | 2.2401 |
|
| 65 |
-
| 1.1446 | 6.9672 | 1700 | 2.2370 |
|
| 66 |
-
| 1.0944 | 7.3770 | 1800 | 2.2993 |
|
| 67 |
-
| 1.0947 | 7.7869 | 1900 | 2.2739 |
|
| 68 |
-
| 1.1601 | 8.1967 | 2000 | 2.3085 |
|
| 69 |
-
| 1.094 | 8.6066 | 2100 | 2.2998 |
|
| 70 |
-
| 0.9896 | 9.0164 | 2200 | 2.3065 |
|
| 71 |
-
| 1.0642 | 9.4262 | 2300 | 2.3114 |
|
| 72 |
-
| 0.9649 | 9.8361 | 2400 | 2.3188 |
|
| 73 |
|
| 74 |
|
| 75 |
### Framework versions
|
|
|
|
| 15 |
|
| 16 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
| 17 |
It achieves the following results on the evaluation set:
|
| 18 |
+
- Loss: 3.1463
|
| 19 |
|
| 20 |
## Model description
|
| 21 |
|
|
|
|
| 44 |
|
| 45 |
### Training results
|
| 46 |
|
| 47 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
| 48 |
+
|:-------------:|:-----:|:----:|:---------------:|
|
| 49 |
+
| 2.9367 | 2.5 | 100 | 3.0852 |
|
| 50 |
+
| 1.9914 | 5.0 | 200 | 2.9862 |
|
| 51 |
+
| 1.5067 | 7.5 | 300 | 3.1059 |
|
| 52 |
+
| 1.2716 | 10.0 | 400 | 3.1463 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 53 |
|
| 54 |
|
| 55 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 497774208
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:acc91276c9640f587c30dc491a144ff0e841ff77fe87cdadba98565823c128a8
|
| 3 |
size 497774208
|
runs/May13_06-55-56_8b3264d74ee3/events.out.tfevents.1715583358.8b3264d74ee3.3000.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:97fac5a566c7ba544b145d13706b31dff8ae77ba0029856a08a09e77c3d4ea9f
|
| 3 |
+
size 7183
|