WilliamHH commited on
Commit
e9f01e2
·
verified ·
1 Parent(s): e985c77

End of training

Browse files
Files changed (1) hide show
  1. README.md +13 -9
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM-135M](https://huggingface.co/HuggingFaceTB/SmolLM-135M) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 4.3715
20
 
21
  ## Model description
22
 
@@ -41,19 +41,23 @@ The following hyperparameters were used during training:
41
  - seed: 42
42
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: constant
44
- - num_epochs: 2
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:------:|:----:|:---------------:|
50
- | 0.9187 | 0.2667 | 200 | 3.7701 |
51
- | 0.9322 | 0.5333 | 400 | 3.8297 |
52
- | 0.9511 | 0.8 | 600 | 3.8804 |
53
- | 0.8337 | 1.0667 | 800 | 4.2631 |
54
- | 0.4772 | 1.3333 | 1000 | 4.3414 |
55
- | 0.5198 | 1.6 | 1200 | 4.3616 |
56
- | 0.5438 | 1.8667 | 1400 | 4.3715 |
 
 
 
 
57
 
58
 
59
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM-135M](https://huggingface.co/HuggingFaceTB/SmolLM-135M) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 4.6061
20
 
21
  ## Model description
22
 
 
41
  - seed: 42
42
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: constant
44
+ - num_epochs: 3
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:------:|:----:|:---------------:|
50
+ | 0.2744 | 0.2667 | 200 | 3.9036 |
51
+ | 0.3205 | 0.5333 | 400 | 3.9461 |
52
+ | 0.3397 | 0.8 | 600 | 4.1281 |
53
+ | 0.328 | 1.0667 | 800 | 4.3542 |
54
+ | 0.229 | 1.3333 | 1000 | 4.3238 |
55
+ | 0.2537 | 1.6 | 1200 | 4.4181 |
56
+ | 0.2656 | 1.8667 | 1400 | 4.4773 |
57
+ | 0.2381 | 2.1333 | 1600 | 4.4933 |
58
+ | 0.2086 | 2.4 | 1800 | 4.4857 |
59
+ | 0.2258 | 2.6667 | 2000 | 4.5592 |
60
+ | 0.2313 | 2.9333 | 2200 | 4.6061 |
61
 
62
 
63
  ### Framework versions