MariaZafar commited on
Commit
6220427
·
1 Parent(s): ce1db55

End of training

Browse files
Files changed (2) hide show
  1. README.md +52 -3
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -14,9 +14,9 @@ probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 5.8858
18
- - Validation Loss: 7.5655
19
- - Epoch: 0
20
 
21
  ## Model description
22
 
@@ -43,6 +43,55 @@ The following hyperparameters were used during training:
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
  | 5.8858 | 7.5655 | 0 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
 
47
 
48
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 0.7785
18
+ - Validation Loss: 3.7004
19
+ - Epoch: 49
20
 
21
  ## Model description
22
 
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
  | 5.8858 | 7.5655 | 0 |
46
+ | 4.0619 | 5.8193 | 1 |
47
+ | 3.3766 | 4.9585 | 2 |
48
+ | 3.0686 | 4.5764 | 3 |
49
+ | 2.9022 | 4.3847 | 4 |
50
+ | 2.7838 | 4.2249 | 5 |
51
+ | 2.6997 | 4.1060 | 6 |
52
+ | 2.6154 | 4.0100 | 7 |
53
+ | 2.5575 | 3.9412 | 8 |
54
+ | 2.4933 | 3.8447 | 9 |
55
+ | 2.4397 | 3.7619 | 10 |
56
+ | 2.3835 | 3.7510 | 11 |
57
+ | 2.3403 | 3.6810 | 12 |
58
+ | 2.2924 | 3.6716 | 13 |
59
+ | 2.2513 | 3.6335 | 14 |
60
+ | 2.2031 | 3.6208 | 15 |
61
+ | 2.1619 | 3.5915 | 16 |
62
+ | 2.1234 | 3.5497 | 17 |
63
+ | 2.0792 | 3.5540 | 18 |
64
+ | 2.0398 | 3.5461 | 19 |
65
+ | 1.9976 | 3.5282 | 20 |
66
+ | 1.9577 | 3.5260 | 21 |
67
+ | 1.9176 | 3.5041 | 22 |
68
+ | 1.8745 | 3.4994 | 23 |
69
+ | 1.8304 | 3.5250 | 24 |
70
+ | 1.7881 | 3.4864 | 25 |
71
+ | 1.7423 | 3.4718 | 26 |
72
+ | 1.6993 | 3.5194 | 27 |
73
+ | 1.6503 | 3.5019 | 28 |
74
+ | 1.6025 | 3.5055 | 29 |
75
+ | 1.5500 | 3.5109 | 30 |
76
+ | 1.4964 | 3.5389 | 31 |
77
+ | 1.4448 | 3.5393 | 32 |
78
+ | 1.3954 | 3.5363 | 33 |
79
+ | 1.3464 | 3.5446 | 34 |
80
+ | 1.2978 | 3.5117 | 35 |
81
+ | 1.2494 | 3.5225 | 36 |
82
+ | 1.2004 | 3.5443 | 37 |
83
+ | 1.1534 | 3.5909 | 38 |
84
+ | 1.1124 | 3.5380 | 39 |
85
+ | 1.0709 | 3.6162 | 40 |
86
+ | 1.0265 | 3.6758 | 41 |
87
+ | 0.9936 | 3.6168 | 42 |
88
+ | 0.9590 | 3.6243 | 43 |
89
+ | 0.9238 | 3.6308 | 44 |
90
+ | 0.8886 | 3.6429 | 45 |
91
+ | 0.8635 | 3.7137 | 46 |
92
+ | 0.8352 | 3.6512 | 47 |
93
+ | 0.8050 | 3.7033 | 48 |
94
+ | 0.7785 | 3.7004 | 49 |
95
 
96
 
97
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:59ee92682584d08a3d633ad099b802282c2f5e0fd7369bcf01044fa7e172f1f8
3
  size 497935464
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9944d1f7dba7fe33a9355a1250346b48d5815a40f1d42c208dffb390a4170e45
3
  size 497935464