Commit ·
6220427
1
Parent(s): ce1db55
End of training
Browse files- README.md +52 -3
- tf_model.h5 +1 -1
README.md
CHANGED
|
@@ -14,9 +14,9 @@ probably proofread and complete it, then remove this comment. -->
|
|
| 14 |
|
| 15 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
| 16 |
It achieves the following results on the evaluation set:
|
| 17 |
-
- Train Loss:
|
| 18 |
-
- Validation Loss:
|
| 19 |
-
- Epoch:
|
| 20 |
|
| 21 |
## Model description
|
| 22 |
|
|
@@ -43,6 +43,55 @@ The following hyperparameters were used during training:
|
|
| 43 |
| Train Loss | Validation Loss | Epoch |
|
| 44 |
|:----------:|:---------------:|:-----:|
|
| 45 |
| 5.8858 | 7.5655 | 0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 46 |
|
| 47 |
|
| 48 |
### Framework versions
|
|
|
|
| 14 |
|
| 15 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
| 16 |
It achieves the following results on the evaluation set:
|
| 17 |
+
- Train Loss: 0.7785
|
| 18 |
+
- Validation Loss: 3.7004
|
| 19 |
+
- Epoch: 49
|
| 20 |
|
| 21 |
## Model description
|
| 22 |
|
|
|
|
| 43 |
| Train Loss | Validation Loss | Epoch |
|
| 44 |
|:----------:|:---------------:|:-----:|
|
| 45 |
| 5.8858 | 7.5655 | 0 |
|
| 46 |
+
| 4.0619 | 5.8193 | 1 |
|
| 47 |
+
| 3.3766 | 4.9585 | 2 |
|
| 48 |
+
| 3.0686 | 4.5764 | 3 |
|
| 49 |
+
| 2.9022 | 4.3847 | 4 |
|
| 50 |
+
| 2.7838 | 4.2249 | 5 |
|
| 51 |
+
| 2.6997 | 4.1060 | 6 |
|
| 52 |
+
| 2.6154 | 4.0100 | 7 |
|
| 53 |
+
| 2.5575 | 3.9412 | 8 |
|
| 54 |
+
| 2.4933 | 3.8447 | 9 |
|
| 55 |
+
| 2.4397 | 3.7619 | 10 |
|
| 56 |
+
| 2.3835 | 3.7510 | 11 |
|
| 57 |
+
| 2.3403 | 3.6810 | 12 |
|
| 58 |
+
| 2.2924 | 3.6716 | 13 |
|
| 59 |
+
| 2.2513 | 3.6335 | 14 |
|
| 60 |
+
| 2.2031 | 3.6208 | 15 |
|
| 61 |
+
| 2.1619 | 3.5915 | 16 |
|
| 62 |
+
| 2.1234 | 3.5497 | 17 |
|
| 63 |
+
| 2.0792 | 3.5540 | 18 |
|
| 64 |
+
| 2.0398 | 3.5461 | 19 |
|
| 65 |
+
| 1.9976 | 3.5282 | 20 |
|
| 66 |
+
| 1.9577 | 3.5260 | 21 |
|
| 67 |
+
| 1.9176 | 3.5041 | 22 |
|
| 68 |
+
| 1.8745 | 3.4994 | 23 |
|
| 69 |
+
| 1.8304 | 3.5250 | 24 |
|
| 70 |
+
| 1.7881 | 3.4864 | 25 |
|
| 71 |
+
| 1.7423 | 3.4718 | 26 |
|
| 72 |
+
| 1.6993 | 3.5194 | 27 |
|
| 73 |
+
| 1.6503 | 3.5019 | 28 |
|
| 74 |
+
| 1.6025 | 3.5055 | 29 |
|
| 75 |
+
| 1.5500 | 3.5109 | 30 |
|
| 76 |
+
| 1.4964 | 3.5389 | 31 |
|
| 77 |
+
| 1.4448 | 3.5393 | 32 |
|
| 78 |
+
| 1.3954 | 3.5363 | 33 |
|
| 79 |
+
| 1.3464 | 3.5446 | 34 |
|
| 80 |
+
| 1.2978 | 3.5117 | 35 |
|
| 81 |
+
| 1.2494 | 3.5225 | 36 |
|
| 82 |
+
| 1.2004 | 3.5443 | 37 |
|
| 83 |
+
| 1.1534 | 3.5909 | 38 |
|
| 84 |
+
| 1.1124 | 3.5380 | 39 |
|
| 85 |
+
| 1.0709 | 3.6162 | 40 |
|
| 86 |
+
| 1.0265 | 3.6758 | 41 |
|
| 87 |
+
| 0.9936 | 3.6168 | 42 |
|
| 88 |
+
| 0.9590 | 3.6243 | 43 |
|
| 89 |
+
| 0.9238 | 3.6308 | 44 |
|
| 90 |
+
| 0.8886 | 3.6429 | 45 |
|
| 91 |
+
| 0.8635 | 3.7137 | 46 |
|
| 92 |
+
| 0.8352 | 3.6512 | 47 |
|
| 93 |
+
| 0.8050 | 3.7033 | 48 |
|
| 94 |
+
| 0.7785 | 3.7004 | 49 |
|
| 95 |
|
| 96 |
|
| 97 |
### Framework versions
|
tf_model.h5
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 497935464
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9944d1f7dba7fe33a9355a1250346b48d5815a40f1d42c208dffb390a4170e45
|
| 3 |
size 497935464
|