JstnMcBrd commited on
Commit
a8134cb
·
verified ·
1 Parent(s): 8edd0bb

End of training

Browse files
Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -15,6 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
15
  # SmolLM2-135M-finetuned-python-purpose
16
 
17
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM2-135M](https://huggingface.co/HuggingFaceTB/SmolLM2-135M) on an unknown dataset.
 
 
18
 
19
  ## Model description
20
 
@@ -39,7 +41,18 @@ The following hyperparameters were used during training:
39
  - seed: 42
40
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
41
  - lr_scheduler_type: linear
42
- - num_epochs: 4
 
 
 
 
 
 
 
 
 
 
 
43
 
44
  ### Framework versions
45
 
 
15
  # SmolLM2-135M-finetuned-python-purpose
16
 
17
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM2-135M](https://huggingface.co/HuggingFaceTB/SmolLM2-135M) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 0.3285
20
 
21
  ## Model description
22
 
 
41
  - seed: 42
42
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
+ - num_epochs: 5
45
+
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:-----:|:---------------:|
50
+ | 0.7266 | 1.0 | 2745 | 0.6829 |
51
+ | 0.4378 | 2.0 | 5490 | 0.5090 |
52
+ | 0.2008 | 3.0 | 8235 | 0.3869 |
53
+ | 0.071 | 4.0 | 10980 | 0.3293 |
54
+ | 0.0145 | 5.0 | 13725 | 0.3285 |
55
+
56
 
57
  ### Framework versions
58