ninagroot/Llama-360Mtest

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 6.8777
 ## Model description
@@ -48,20 +48,20 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.89  | 5    | 8.5719          |
-| No log        | 1.96  | 11   | 8.5382          |
-| No log        | 2.84  | 16   | 8.4906          |
-| 8.522         | 3.91  | 22   | 8.4103          |
-| 8.522         | 4.98  | 28   | 8.3006          |
-| 8.522         | 5.87  | 33   | 8.1833          |
-| 8.522         | 6.93  | 39   | 8.0064          |
-| 8.155         | 8.0   | 45   | 7.7918          |
-| 8.155         | 8.89  | 50   | 7.6069          |
-| 8.155         | 9.96  | 56   | 7.3969          |
-| 7.4478        | 10.84 | 61   | 7.2374          |
-| 7.4478        | 11.91 | 67   | 7.0722          |
-| 7.4478        | 12.98 | 73   | 6.9248          |
-| 7.4478        | 13.33 | 75   | 6.8777          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.8937
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.89  | 5    | 8.5629          |
+| No log        | 1.96  | 11   | 8.5316          |
+| No log        | 2.84  | 16   | 8.4872          |
+| 8.5217        | 3.91  | 22   | 8.4115          |
+| 8.5217        | 4.98  | 28   | 8.3069          |
+| 8.5217        | 5.87  | 33   | 8.1945          |
+| 8.5217        | 6.93  | 39   | 8.0237          |
+| 8.174         | 8.0   | 45   | 7.8173          |
+| 8.174         | 8.89  | 50   | 7.6343          |
+| 8.174         | 9.96  | 56   | 7.4226          |
+| 7.4864        | 10.84 | 61   | 7.2617          |
+| 7.4864        | 11.91 | 67   | 7.0916          |
+| 7.4864        | 12.98 | 73   | 6.9408          |
+| 7.4864        | 13.33 | 75   | 6.8937          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b2919e3e0d4c1c2a7995e2d168dbbbe9d9735fa2dfcafbca0859cfbe166a54a0
 size 1344172280

 version https://git-lfs.github.com/spec/v1
+oid sha256:9540083c13f66af40a2d681a8cc3433b17aa1087a1ee83d06484297c0ccaad0a
 size 1344172280

runs/Mar21_13-46-04_gcn56.local.snellius.surf.nl/events.out.tfevents.1711025174.gcn56.local.snellius.surf.nl.2492762.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e6a6a016138b1385ea1ed90574e01bf48c414c1606759958ae65a42081e0954a
+size 8852

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a2cf339ba3fdb47c3376a77817252105e4cdf0ca84f273bd2db9a83d619952a5
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:441af247b61748e0071ab6c73277ce2d0e45005e5cab77b93c7097303d530ab4
 size 4728