rbelanec commited on
Commit
4d51061
·
verified ·
1 Parent(s): f4193aa

Model save

Browse files
Files changed (2) hide show
  1. README.md +22 -22
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -17,10 +17,10 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  # test
19
 
20
- This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the wsc dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.3491
23
- - Num Input Tokens Seen: 43904
24
 
25
  ## Model description
26
 
@@ -52,25 +52,25 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Input Tokens Seen |
54
  |:-------------:|:------:|:----:|:---------------:|:-----------------:|
55
- | 0.7462 | 0.0522 | 13 | 0.6849 | 2288 |
56
- | 0.6639 | 0.1044 | 26 | 0.4557 | 4656 |
57
- | 0.3742 | 0.1566 | 39 | 0.3849 | 6944 |
58
- | 0.3565 | 0.2088 | 52 | 0.3768 | 9232 |
59
- | 0.3087 | 0.2610 | 65 | 0.3713 | 11424 |
60
- | 0.3607 | 0.3133 | 78 | 0.3614 | 13760 |
61
- | 0.3589 | 0.3655 | 91 | 0.3609 | 16048 |
62
- | 0.2898 | 0.4177 | 104 | 0.3723 | 18272 |
63
- | 0.4246 | 0.4699 | 117 | 0.3699 | 20656 |
64
- | 0.3657 | 0.5221 | 130 | 0.3523 | 23056 |
65
- | 0.3637 | 0.5743 | 143 | 0.3551 | 25312 |
66
- | 0.3938 | 0.6265 | 156 | 0.3517 | 27552 |
67
- | 0.3198 | 0.6787 | 169 | 0.3546 | 29984 |
68
- | 0.369 | 0.7309 | 182 | 0.3491 | 32080 |
69
- | 0.3673 | 0.7831 | 195 | 0.3541 | 34176 |
70
- | 0.3675 | 0.8353 | 208 | 0.3513 | 36512 |
71
- | 0.3634 | 0.8876 | 221 | 0.3547 | 38912 |
72
- | 0.3446 | 0.9398 | 234 | 0.3519 | 41120 |
73
- | 0.3364 | 0.9920 | 247 | 0.3516 | 43600 |
74
 
75
 
76
  ### Framework versions
 
17
 
18
  # test
19
 
20
+ This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.5010
23
+ - Num Input Tokens Seen: 43600
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Input Tokens Seen |
54
  |:-------------:|:------:|:----:|:---------------:|:-----------------:|
55
+ | 0.9316 | 0.0522 | 13 | 0.9549 | 2288 |
56
+ | 1.1199 | 0.1044 | 26 | 0.8822 | 4656 |
57
+ | 0.8317 | 0.1566 | 39 | 0.8176 | 6944 |
58
+ | 0.7882 | 0.2088 | 52 | 0.7668 | 9232 |
59
+ | 0.7909 | 0.2610 | 65 | 0.6973 | 11424 |
60
+ | 0.7007 | 0.3133 | 78 | 0.6643 | 13760 |
61
+ | 0.7416 | 0.3655 | 91 | 0.6244 | 16048 |
62
+ | 0.8212 | 0.4177 | 104 | 0.5990 | 18272 |
63
+ | 0.4927 | 0.4699 | 117 | 0.5652 | 20656 |
64
+ | 0.5708 | 0.5221 | 130 | 0.5375 | 23056 |
65
+ | 0.4855 | 0.5743 | 143 | 0.5332 | 25312 |
66
+ | 0.5239 | 0.6265 | 156 | 0.5173 | 27552 |
67
+ | 0.4772 | 0.6787 | 169 | 0.5134 | 29984 |
68
+ | 0.4958 | 0.7309 | 182 | 0.5051 | 32080 |
69
+ | 0.6547 | 0.7831 | 195 | 0.5062 | 34176 |
70
+ | 0.6246 | 0.8353 | 208 | 0.5012 | 36512 |
71
+ | 0.5174 | 0.8876 | 221 | 0.4947 | 38912 |
72
+ | 0.5318 | 0.9398 | 234 | 0.4977 | 41120 |
73
+ | 0.445 | 0.9920 | 247 | 0.5010 | 43600 |
74
 
75
 
76
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8a845aa3055a3b6c72a52119a58260b585c8c4a5038f6aae9b50044eba58f5db
3
  size 312947112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c1f41d907e743b9486bd6bdd45c5b3600a59054e65f165c52b7f8cf15e44577b
3
  size 312947112