rbelanec
/

test

@@ -17,10 +17,10 @@ should probably proofread and complete it, then remove this comment. -->
 # test
-This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the wsc dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3491
-- Num Input Tokens Seen: 43904
 ## Model description
@@ -52,25 +52,25 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Input Tokens Seen |
 |:-------------:|:------:|:----:|:---------------:|:-----------------:|
-| 0.7462        | 0.0522 | 13   | 0.6849          | 2288              |
-| 0.6639        | 0.1044 | 26   | 0.4557          | 4656              |
-| 0.3742        | 0.1566 | 39   | 0.3849          | 6944              |
-| 0.3565        | 0.2088 | 52   | 0.3768          | 9232              |
-| 0.3087        | 0.2610 | 65   | 0.3713          | 11424             |
-| 0.3607        | 0.3133 | 78   | 0.3614          | 13760             |
-| 0.3589        | 0.3655 | 91   | 0.3609          | 16048             |
-| 0.2898        | 0.4177 | 104  | 0.3723          | 18272             |
-| 0.4246        | 0.4699 | 117  | 0.3699          | 20656             |
-| 0.3657        | 0.5221 | 130  | 0.3523          | 23056             |
-| 0.3637        | 0.5743 | 143  | 0.3551          | 25312             |
-| 0.3938        | 0.6265 | 156  | 0.3517          | 27552             |
-| 0.3198        | 0.6787 | 169  | 0.3546          | 29984             |
-| 0.369         | 0.7309 | 182  | 0.3491          | 32080             |
-| 0.3673        | 0.7831 | 195  | 0.3541          | 34176             |
-| 0.3675        | 0.8353 | 208  | 0.3513          | 36512             |
-| 0.3634        | 0.8876 | 221  | 0.3547          | 38912             |
-| 0.3446        | 0.9398 | 234  | 0.3519          | 41120             |
-| 0.3364        | 0.9920 | 247  | 0.3516          | 43600             |
 ### Framework versions

 # test
+This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5010
+- Num Input Tokens Seen: 43600
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss | Input Tokens Seen |
 |:-------------:|:------:|:----:|:---------------:|:-----------------:|
+| 0.9316        | 0.0522 | 13   | 0.9549          | 2288              |
+| 1.1199        | 0.1044 | 26   | 0.8822          | 4656              |
+| 0.8317        | 0.1566 | 39   | 0.8176          | 6944              |
+| 0.7882        | 0.2088 | 52   | 0.7668          | 9232              |
+| 0.7909        | 0.2610 | 65   | 0.6973          | 11424             |
+| 0.7007        | 0.3133 | 78   | 0.6643          | 13760             |
+| 0.7416        | 0.3655 | 91   | 0.6244          | 16048             |
+| 0.8212        | 0.4177 | 104  | 0.5990          | 18272             |
+| 0.4927        | 0.4699 | 117  | 0.5652          | 20656             |
+| 0.5708        | 0.5221 | 130  | 0.5375          | 23056             |
+| 0.4855        | 0.5743 | 143  | 0.5332          | 25312             |
+| 0.5239        | 0.6265 | 156  | 0.5173          | 27552             |
+| 0.4772        | 0.6787 | 169  | 0.5134          | 29984             |
+| 0.4958        | 0.7309 | 182  | 0.5051          | 32080             |
+| 0.6547        | 0.7831 | 195  | 0.5062          | 34176             |
+| 0.6246        | 0.8353 | 208  | 0.5012          | 36512             |
+| 0.5174        | 0.8876 | 221  | 0.4947          | 38912             |
+| 0.5318        | 0.9398 | 234  | 0.4977          | 41120             |
+| 0.445         | 0.9920 | 247  | 0.5010          | 43600             |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8a845aa3055a3b6c72a52119a58260b585c8c4a5038f6aae9b50044eba58f5db
 size 312947112

 version https://git-lfs.github.com/spec/v1
+oid sha256:c1f41d907e743b9486bd6bdd45c5b3600a59054e65f165c52b7f8cf15e44577b
 size 312947112