Mathildeholst commited on
Commit
4ea4a18
·
verified ·
1 Parent(s): 39051d5

End of training

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM2-135M](https://huggingface.co/HuggingFaceTB/SmolLM2-135M) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 6.6796
20
 
21
  ## Model description
22
 
@@ -47,17 +47,17 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 7.629 | 0.32 | 200 | 6.6328 |
51
- | 6.3124 | 0.64 | 400 | 6.6325 |
52
- | 6.3561 | 0.96 | 600 | 6.5846 |
53
- | 6.338 | 1.28 | 800 | 6.7885 |
54
- | 6.3065 | 1.6 | 1000 | 6.7319 |
55
- | 6.3446 | 1.92 | 1200 | 6.6796 |
56
 
57
 
58
  ### Framework versions
59
 
60
- - Transformers 4.56.1
61
  - Pytorch 2.8.0+cu126
62
  - Datasets 4.0.0
63
- - Tokenizers 0.22.0
 
16
 
17
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM2-135M](https://huggingface.co/HuggingFaceTB/SmolLM2-135M) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 6.8104
20
 
21
  ## Model description
22
 
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 7.6463 | 0.32 | 200 | 6.4905 |
51
+ | 6.2428 | 0.64 | 400 | 6.3232 |
52
+ | 6.3949 | 0.96 | 600 | 6.5444 |
53
+ | 6.3416 | 1.28 | 800 | 6.8307 |
54
+ | 6.5742 | 1.6 | 1000 | 6.9138 |
55
+ | 6.4862 | 1.92 | 1200 | 6.8104 |
56
 
57
 
58
  ### Framework versions
59
 
60
+ - Transformers 4.57.1
61
  - Pytorch 2.8.0+cu126
62
  - Datasets 4.0.0
63
+ - Tokenizers 0.22.1