eliwill commited on
Commit
6dc9a4e
·
1 Parent(s): cad525a

End of training

Browse files
README.md CHANGED
@@ -14,9 +14,9 @@ probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 4.0230
18
- - Validation Loss: 3.9474
19
- - Epoch: 0
20
 
21
  ## Model description
22
 
@@ -43,6 +43,15 @@ The following hyperparameters were used during training:
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
  | 4.0230 | 3.9474 | 0 |
 
 
 
 
 
 
 
 
 
46
 
47
 
48
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 3.4753
18
+ - Validation Loss: 3.7980
19
+ - Epoch: 9
20
 
21
  ## Model description
22
 
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
  | 4.0230 | 3.9474 | 0 |
46
+ | 3.8580 | 3.8982 | 1 |
47
+ | 3.7757 | 3.8721 | 2 |
48
+ | 3.7149 | 3.8489 | 3 |
49
+ | 3.6640 | 3.8343 | 4 |
50
+ | 3.6210 | 3.8152 | 5 |
51
+ | 3.5796 | 3.8088 | 6 |
52
+ | 3.5429 | 3.8038 | 7 |
53
+ | 3.5061 | 3.7967 | 8 |
54
+ | 3.4753 | 3.7980 | 9 |
55
 
56
 
57
  ### Framework versions
logs/train/events.out.tfevents.1664130319.b847084d5e3f.67.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:949d722245d0c296dc18402ebc25f42b7202fe582c30ebb5b20e0328a25b79ec
3
- size 891164
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:102555006dde7e32fcab6a3daedd9e1b4953c68910bccca874f9ca05d4f965a2
3
+ size 891776
logs/validation/events.out.tfevents.1664130386.b847084d5e3f.67.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2274e4d882d22865be3d6a3ba6a5a8f40a249bc53d86b3cb9d25ed57c714b122
3
- size 194
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e336d1775231c3f1446bd9aa2e9268b29760e39ad62300f60f457a5a10d12c8
3
+ size 1598
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:db92ec17abc583b958573e28ef3a53f2bd3485c944afef985b4b6bdb95e65606
3
  size 327745496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10d4bfe09f2b302dd0acb09cc1cf3d746d45e5052ea6079c74033cbf6344c7ef
3
  size 327745496