ninagroot commited on
Commit
58c2d58
·
verified ·
1 Parent(s): fd92771

ninagroot/Llama-360Mtest

Browse files
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 4.3475
17
 
18
  ## Model description
19
 
@@ -48,21 +48,21 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 8.4018 | 0.99 | 44 | 8.2779 |
52
- | 7.5564 | 1.99 | 88 | 7.2124 |
53
- | 6.742 | 2.98 | 132 | 6.4726 |
54
- | 6.0531 | 4.0 | 177 | 5.8848 |
55
- | 5.1195 | 4.99 | 221 | 5.2837 |
56
- | 4.5893 | 5.99 | 265 | 4.8101 |
57
- | 4.3185 | 6.98 | 309 | 4.6188 |
58
- | 4.0957 | 8.0 | 354 | 4.4767 |
59
- | 3.7674 | 8.99 | 398 | 4.4084 |
60
- | 3.6238 | 9.99 | 442 | 4.3695 |
61
- | 3.5106 | 10.98 | 486 | 4.3419 |
62
- | 3.2515 | 12.0 | 531 | 4.3291 |
63
- | 3.0916 | 12.99 | 575 | 4.3472 |
64
- | 3.0072 | 13.99 | 619 | 4.3490 |
65
- | 3.0306 | 14.92 | 660 | 4.3475 |
66
 
67
 
68
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 5.2706
17
 
18
  ## Model description
19
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 8.4154 | 0.99 | 44 | 8.2674 |
52
+ | 7.3733 | 1.98 | 88 | 7.2246 |
53
+ | 6.4378 | 3.0 | 133 | 6.5650 |
54
+ | 5.5786 | 3.99 | 177 | 6.1513 |
55
+ | 4.8345 | 4.98 | 221 | 5.7858 |
56
+ | 4.3034 | 5.99 | 266 | 5.4541 |
57
+ | 4.019 | 6.99 | 310 | 5.2054 |
58
+ | 3.5206 | 8.0 | 355 | 5.0984 |
59
+ | 3.0144 | 8.99 | 399 | 5.0603 |
60
+ | 2.6052 | 9.98 | 443 | 5.0552 |
61
+ | 2.2063 | 11.0 | 488 | 5.1439 |
62
+ | 1.7308 | 11.99 | 532 | 5.1838 |
63
+ | 1.4794 | 12.98 | 576 | 5.2275 |
64
+ | 1.2218 | 13.99 | 621 | 5.2608 |
65
+ | 1.1556 | 14.87 | 660 | 5.2706 |
66
 
67
 
68
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:70a56446823a6b9000e94343024eda185ec2de87f84c064ed04df281a8c996f0
3
  size 1344172280
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8964bc33b0bce23ca552b07f763bc59b0b28176fe18c016c02363dcea9199c3c
3
  size 1344172280
runs/Mar20_15-28-58_gcn7.local.snellius.surf.nl/events.out.tfevents.1710944950.gcn7.local.snellius.surf.nl.1480103.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:016d94e8143710e80b020cf8599903d062f6cb1afb58006f48ba8a13f6ed787e
3
+ size 13889
tokenizer_config.json CHANGED
@@ -37,7 +37,7 @@
37
  "bos_token": "<s>",
38
  "clean_up_tokenization_spaces": true,
39
  "eos_token": "</s>",
40
- "model_max_length": 128,
41
  "pad_token": "<pad>",
42
  "tokenizer_class": "GPT2Tokenizer",
43
  "unk_token": "<|endoftext|>"
 
37
  "bos_token": "<s>",
38
  "clean_up_tokenization_spaces": true,
39
  "eos_token": "</s>",
40
+ "model_max_length": 100,
41
  "pad_token": "<pad>",
42
  "tokenizer_class": "GPT2Tokenizer",
43
  "unk_token": "<|endoftext|>"
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:438d09597f6c502b8b3134429dc8a3ce76adbac05dab665a370b5b862e1699ee
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f2b624f97bbf017a411c0d68c58ad13a58993c06d08892fa70debc40d6021e32
3
  size 4728