ninagroot commited on
Commit
5e652fe
·
verified ·
1 Parent(s): 8ffb0f6

ninagroot/babyllamatest

Browse files
Files changed (4) hide show
  1. README.md +21 -21
  2. config.json +1 -1
  3. model.safetensors +2 -2
  4. training_args.bin +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 3.9293
17
 
18
  ## Model description
19
 
@@ -46,26 +46,26 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 217.7542 | 1.0 | 69 | 169.6833 |
50
- | 140.0595 | 2.0 | 138 | 102.3135 |
51
- | 64.7821 | 3.0 | 207 | 47.4496 |
52
- | 26.8103 | 4.0 | 276 | 19.3383 |
53
- | 12.782 | 5.0 | 345 | 12.0085 |
54
- | 9.8432 | 6.0 | 414 | 8.0061 |
55
- | 6.9448 | 7.0 | 483 | 6.6308 |
56
- | 6.1985 | 8.0 | 552 | 6.0272 |
57
- | 5.3316 | 9.0 | 621 | 5.6098 |
58
- | 4.7103 | 10.0 | 690 | 5.0774 |
59
- | 4.3456 | 11.0 | 759 | 4.8933 |
60
- | 4.1052 | 12.0 | 828 | 4.6336 |
61
- | 4.0201 | 13.0 | 897 | 4.4522 |
62
- | 3.7028 | 14.0 | 966 | 4.2817 |
63
- | 3.4861 | 15.0 | 1035 | 4.1521 |
64
- | 3.3937 | 16.0 | 1104 | 4.0707 |
65
- | 3.2937 | 17.0 | 1173 | 3.9879 |
66
- | 3.2748 | 18.0 | 1242 | 3.9467 |
67
- | 3.2268 | 19.0 | 1311 | 3.9353 |
68
- | 3.1461 | 20.0 | 1380 | 3.9293 |
69
 
70
 
71
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 3.4868
17
 
18
  ## Model description
19
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 16.2182 | 1.0 | 393 | 13.7445 |
50
+ | 6.1598 | 2.0 | 786 | 6.7784 |
51
+ | 5.094 | 3.0 | 1179 | 5.4728 |
52
+ | 4.2823 | 4.0 | 1572 | 4.9842 |
53
+ | 3.7105 | 5.0 | 1965 | 4.6118 |
54
+ | 3.3325 | 6.0 | 2358 | 4.4379 |
55
+ | 3.1282 | 7.0 | 2751 | 4.2705 |
56
+ | 2.9706 | 8.0 | 3144 | 4.0921 |
57
+ | 2.8795 | 9.0 | 3537 | 3.9575 |
58
+ | 2.5869 | 10.0 | 3930 | 3.8738 |
59
+ | 2.6449 | 11.0 | 4323 | 3.8033 |
60
+ | 2.4537 | 12.0 | 4716 | 3.7222 |
61
+ | 2.4489 | 13.0 | 5109 | 3.6770 |
62
+ | 2.237 | 14.0 | 5502 | 3.6201 |
63
+ | 2.2934 | 15.0 | 5895 | 3.5597 |
64
+ | 2.2597 | 16.0 | 6288 | 3.5336 |
65
+ | 2.2667 | 17.0 | 6681 | 3.5108 |
66
+ | 2.2947 | 18.0 | 7074 | 3.4935 |
67
+ | 2.1618 | 19.0 | 7467 | 3.4894 |
68
+ | 2.2033 | 20.0 | 7860 | 3.4868 |
69
 
70
 
71
  ### Framework versions
config.json CHANGED
@@ -24,5 +24,5 @@
24
  "torch_dtype": "float32",
25
  "transformers_version": "4.39.1",
26
  "use_cache": true,
27
- "vocab_size": 4312
28
  }
 
24
  "torch_dtype": "float32",
25
  "transformers_version": "4.39.1",
26
  "use_cache": true,
27
+ "vocab_size": 32000
28
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8ccdb44254ef68be2db7f409e2555a42b7bf2111a213da449857b19375dce7e5
3
- size 185517896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18587d669e316d918d29fef1f1c52079e33a2b167f33c3ccd433d7cd84931186
3
+ size 298928096
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:72b13b0c38155039527a0899d7e7fd8f62fc72ce5bcb791e539c84eab61b13ab
3
  size 4984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:42dba10b5115a01d6be8e65830dfd27ac82bb1251d44e3f697c99e87edbab09d
3
  size 4984