ninagroot/babyllamatest
Browse files- README.md +41 -41
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 13 |
|
| 14 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
-
- Loss: 12.
|
| 17 |
|
| 18 |
## Model description
|
| 19 |
|
|
@@ -46,46 +46,46 @@ The following hyperparameters were used during training:
|
|
| 46 |
|
| 47 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 48 |
|:-------------:|:-----:|:----:|:---------------:|
|
| 49 |
-
| 81.
|
| 50 |
-
| 81.
|
| 51 |
-
| 78.
|
| 52 |
-
|
|
| 53 |
-
| 75.
|
| 54 |
-
|
|
| 55 |
-
|
|
| 56 |
-
|
|
| 57 |
-
| 64.
|
| 58 |
-
|
|
| 59 |
-
| 57.
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
| 48.
|
| 63 |
-
| 45.
|
| 64 |
-
| 44.
|
| 65 |
-
|
|
| 66 |
-
|
|
| 67 |
-
| 38.
|
| 68 |
-
|
|
| 69 |
-
| 34.
|
| 70 |
-
| 30.
|
| 71 |
-
| 29.
|
| 72 |
-
| 27.
|
| 73 |
-
| 25.
|
| 74 |
-
| 25.
|
| 75 |
-
| 24.
|
| 76 |
-
| 21.
|
| 77 |
-
| 20.
|
| 78 |
-
| 18.
|
| 79 |
-
| 17.
|
| 80 |
-
| 16.
|
| 81 |
-
| 16.
|
| 82 |
-
| 15.
|
| 83 |
-
| 14.
|
| 84 |
-
| 13.
|
| 85 |
-
| 13.
|
| 86 |
-
| 13.
|
| 87 |
-
| 13.
|
| 88 |
-
| 13.
|
| 89 |
|
| 90 |
|
| 91 |
### Framework versions
|
|
|
|
| 13 |
|
| 14 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
+
- Loss: 12.7407
|
| 17 |
|
| 18 |
## Model description
|
| 19 |
|
|
|
|
| 46 |
|
| 47 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 48 |
|:-------------:|:-----:|:----:|:---------------:|
|
| 49 |
+
| 81.6068 | 1.0 | 2 | 75.8719 |
|
| 50 |
+
| 81.1403 | 2.0 | 4 | 74.6429 |
|
| 51 |
+
| 78.4746 | 3.0 | 6 | 72.5604 |
|
| 52 |
+
| 78.6147 | 4.0 | 8 | 69.6859 |
|
| 53 |
+
| 75.1485 | 5.0 | 10 | 67.9944 |
|
| 54 |
+
| 73.5182 | 6.0 | 12 | 64.5075 |
|
| 55 |
+
| 69.6393 | 7.0 | 14 | 61.3852 |
|
| 56 |
+
| 66.9895 | 8.0 | 16 | 58.5262 |
|
| 57 |
+
| 64.4746 | 9.0 | 18 | 55.6940 |
|
| 58 |
+
| 60.8097 | 10.0 | 20 | 52.6993 |
|
| 59 |
+
| 57.1714 | 11.0 | 22 | 49.5786 |
|
| 60 |
+
| 53.8474 | 12.0 | 24 | 46.5081 |
|
| 61 |
+
| 49.9873 | 13.0 | 26 | 43.6358 |
|
| 62 |
+
| 48.7366 | 14.0 | 28 | 41.0406 |
|
| 63 |
+
| 45.0539 | 15.0 | 30 | 38.7263 |
|
| 64 |
+
| 44.0504 | 16.0 | 32 | 36.6352 |
|
| 65 |
+
| 40.9533 | 17.0 | 34 | 34.6685 |
|
| 66 |
+
| 39.9931 | 18.0 | 36 | 32.7875 |
|
| 67 |
+
| 38.116 | 19.0 | 38 | 30.8567 |
|
| 68 |
+
| 35.4181 | 20.0 | 40 | 28.9705 |
|
| 69 |
+
| 34.0383 | 21.0 | 42 | 27.4282 |
|
| 70 |
+
| 30.7991 | 22.0 | 44 | 26.4171 |
|
| 71 |
+
| 29.8348 | 23.0 | 46 | 24.9225 |
|
| 72 |
+
| 27.9282 | 24.0 | 48 | 23.9103 |
|
| 73 |
+
| 25.8511 | 25.0 | 50 | 22.9495 |
|
| 74 |
+
| 25.1711 | 26.0 | 52 | 21.5530 |
|
| 75 |
+
| 24.2361 | 27.0 | 54 | 20.5871 |
|
| 76 |
+
| 21.9294 | 28.0 | 56 | 19.0727 |
|
| 77 |
+
| 20.435 | 29.0 | 58 | 18.0482 |
|
| 78 |
+
| 18.682 | 30.0 | 60 | 17.0037 |
|
| 79 |
+
| 17.4144 | 31.0 | 62 | 16.0468 |
|
| 80 |
+
| 16.4872 | 32.0 | 64 | 15.2828 |
|
| 81 |
+
| 16.2417 | 33.0 | 66 | 14.6359 |
|
| 82 |
+
| 15.1244 | 34.0 | 68 | 14.1234 |
|
| 83 |
+
| 14.0602 | 35.0 | 70 | 13.5799 |
|
| 84 |
+
| 13.7722 | 36.0 | 72 | 13.3509 |
|
| 85 |
+
| 13.377 | 37.0 | 74 | 12.9960 |
|
| 86 |
+
| 13.4091 | 38.0 | 76 | 12.8183 |
|
| 87 |
+
| 13.1398 | 39.0 | 78 | 12.7614 |
|
| 88 |
+
| 13.1002 | 40.0 | 80 | 12.7407 |
|
| 89 |
|
| 90 |
|
| 91 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 217819016
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b95b266c906f8a10c9a06b7ef00f57ba328d35b02576511cc7d51b8ef129717f
|
| 3 |
size 217819016
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4984
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6d49873d6da6e46a5804e20834fe83e4d20fbe84a83bb526ea7410b0ff377414
|
| 3 |
size 4984
|