ninagroot commited on
Commit
ab1fb71
·
verified ·
1 Parent(s): 4ab7838

ninagroot/Llama-450Mtest

Browse files
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 5.3945
17
 
18
  ## Model description
19
 
@@ -48,42 +48,42 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 8.6075 | 0.89 | 2 | 8.5462 |
52
- | 8.129 | 1.78 | 4 | 8.1922 |
53
- | 7.2799 | 2.67 | 6 | 7.6280 |
54
- | 6.3932 | 4.0 | 9 | 7.0045 |
55
- | 5.9328 | 4.89 | 11 | 6.7460 |
56
- | 5.5236 | 5.78 | 13 | 6.5483 |
57
- | 5.1214 | 6.67 | 15 | 6.2722 |
58
- | 4.615 | 8.0 | 18 | 6.0793 |
59
- | 4.1081 | 8.89 | 20 | 5.8698 |
60
- | 3.7709 | 9.78 | 22 | 5.7750 |
61
- | 3.3048 | 10.67 | 24 | 5.6209 |
62
- | 2.736 | 12.0 | 27 | 5.5056 |
63
- | 2.2953 | 12.89 | 29 | 5.4154 |
64
- | 1.8419 | 13.78 | 31 | 5.4703 |
65
- | 1.383 | 14.67 | 33 | 5.3935 |
66
- | 1.0662 | 16.0 | 36 | 5.4067 |
67
- | 0.7125 | 16.89 | 38 | 5.4292 |
68
- | 0.42 | 17.78 | 40 | 5.3973 |
69
- | 0.3069 | 18.67 | 42 | 5.4252 |
70
- | 0.2301 | 20.0 | 45 | 5.4153 |
71
- | 0.1218 | 20.89 | 47 | 5.4854 |
72
- | 0.1532 | 21.78 | 49 | 5.4369 |
73
- | 0.1541 | 22.67 | 51 | 5.4291 |
74
- | 0.1405 | 24.0 | 54 | 5.3642 |
75
- | 0.1382 | 24.89 | 56 | 5.4114 |
76
- | 0.1086 | 25.78 | 58 | 5.3732 |
77
- | 0.0788 | 26.67 | 60 | 5.3778 |
78
- | 0.0648 | 28.0 | 63 | 5.4038 |
79
- | 0.0545 | 28.89 | 65 | 5.3887 |
80
- | 0.0383 | 29.78 | 67 | 5.3867 |
81
- | 0.0294 | 30.67 | 69 | 5.3882 |
82
- | 0.0322 | 32.0 | 72 | 5.3921 |
83
- | 0.0295 | 32.89 | 74 | 5.3937 |
84
- | 0.0287 | 33.78 | 76 | 5.3943 |
85
- | 0.0257 | 34.67 | 78 | 5.3945 |
86
- | 0.0242 | 35.56 | 80 | 5.3945 |
87
 
88
 
89
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 5.4098
17
 
18
  ## Model description
19
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 8.5925 | 0.89 | 2 | 8.5285 |
52
+ | 8.0799 | 1.78 | 4 | 8.1514 |
53
+ | 7.2928 | 2.67 | 6 | 7.6051 |
54
+ | 6.4189 | 4.0 | 9 | 6.9819 |
55
+ | 5.9528 | 4.89 | 11 | 6.7342 |
56
+ | 5.5041 | 5.78 | 13 | 6.4977 |
57
+ | 5.1038 | 6.67 | 15 | 6.2781 |
58
+ | 4.6407 | 8.0 | 18 | 6.0155 |
59
+ | 4.1063 | 8.89 | 20 | 5.8248 |
60
+ | 3.7293 | 9.78 | 22 | 5.7090 |
61
+ | 3.2205 | 10.67 | 24 | 5.5755 |
62
+ | 2.628 | 12.0 | 27 | 5.4455 |
63
+ | 2.1521 | 12.89 | 29 | 5.4251 |
64
+ | 1.6848 | 13.78 | 31 | 5.4214 |
65
+ | 1.2621 | 14.67 | 33 | 5.3740 |
66
+ | 0.9489 | 16.0 | 36 | 5.4063 |
67
+ | 0.6234 | 16.89 | 38 | 5.4338 |
68
+ | 0.4226 | 17.78 | 40 | 5.4270 |
69
+ | 0.2883 | 18.67 | 42 | 5.4208 |
70
+ | 0.2553 | 20.0 | 45 | 5.3786 |
71
+ | 0.1331 | 20.89 | 47 | 5.4037 |
72
+ | 0.1511 | 21.78 | 49 | 5.3967 |
73
+ | 0.1605 | 22.67 | 51 | 5.4593 |
74
+ | 0.1609 | 24.0 | 54 | 5.3076 |
75
+ | 0.1078 | 24.89 | 56 | 5.3695 |
76
+ | 0.0951 | 25.78 | 58 | 5.3581 |
77
+ | 0.0639 | 26.67 | 60 | 5.3601 |
78
+ | 0.0635 | 28.0 | 63 | 5.3414 |
79
+ | 0.0576 | 28.89 | 65 | 5.3826 |
80
+ | 0.0385 | 29.78 | 67 | 5.4054 |
81
+ | 0.0316 | 30.67 | 69 | 5.4142 |
82
+ | 0.0317 | 32.0 | 72 | 5.4146 |
83
+ | 0.0286 | 32.89 | 74 | 5.4126 |
84
+ | 0.0266 | 33.78 | 76 | 5.4109 |
85
+ | 0.0248 | 34.67 | 78 | 5.4100 |
86
+ | 0.0245 | 35.56 | 80 | 5.4098 |
87
 
88
 
89
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:49b7e962a2a59f555c7e9f062e28b6e4288304ec12651692abed583bb3800e20
3
  size 2875619784
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c16a34e10f09d0ff7f245966497702d03b482c58ec1dee5abaa499de214fcde
3
  size 2875619784
runs/Apr22_11-39-27_gcn8.local.snellius.surf.nl/events.out.tfevents.1713778778.gcn8.local.snellius.surf.nl.3294509.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a04ae9eb9f7432cbd29d97837d302c92afea9e40c777f80ca60853506623bd1
3
+ size 31046
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dc2e563be5865c0f387aea22f10f997b477d9dcfcc8aa8b361fbf42ec71c7225
3
  size 4984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4152a7ba73c41ab9f750486ebb812b4c28ee0be7c4af28a7d653fc20761e5661
3
  size 4984