Harshatheeswar commited on
Commit
b70fcbd
·
verified ·
1 Parent(s): 5b960c6

End of training

Browse files
README.md CHANGED
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # gpt2-scratch
15
 
16
- This model is a fine-tuned version of [Harshatheeswar/gpt2-scratch](https://huggingface.co/Harshatheeswar/gpt2-scratch) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 4.2516
19
 
20
  ## Model description
21
 
@@ -49,11 +49,12 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 4.3012 | 1.0 | 1390 | 4.2516 |
53
 
54
 
55
  ### Framework versions
56
 
57
  - Transformers 4.44.2
58
  - Pytorch 2.4.1+cu121
 
59
  - Tokenizers 0.19.1
 
13
 
14
  # gpt2-scratch
15
 
16
+ This model is a fine-tuned version of [Harshatheeswar/gpt2-scratch](https://huggingface.co/Harshatheeswar/gpt2-scratch) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.1380
19
 
20
  ## Model description
21
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 4.2094 | 1.0 | 1390 | 4.1380 |
53
 
54
 
55
  ### Framework versions
56
 
57
  - Transformers 4.44.2
58
  - Pytorch 2.4.1+cu121
59
+ - Datasets 3.0.1
60
  - Tokenizers 0.19.1
runs/Oct19_14-50-36_22a6a41c53da/events.out.tfevents.1729349446.22a6a41c53da.2967.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3cdc4c9dcf31c1d6f997b0e1a6a4d8b6f2ff28fa4c5cc2331444b6c42622ec8f
3
- size 34401
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d390651776b6cc08e01d05a856c8ed46e3488357de30712a0c7c0fa951cc5f04
3
+ size 35026