ninagroot/Llama-360Mtest

Browse files

Files changed (4) hide show

README.md +17 -18
model.safetensors +1 -1
runs/Mar20_16-10-51_gcn51.local.snellius.surf.nl/events.out.tfevents.1710947464.gcn51.local.snellius.surf.nl.1064152.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.3606
 ## Model description
@@ -33,11 +33,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
-- train_batch_size: 1
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 8
-- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 300
@@ -48,21 +48,20 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 8.3772        | 0.99  | 44   | 8.2600          |
-| 7.4813        | 1.99  | 88   | 7.1917          |
-| 6.7108        | 2.98  | 132  | 6.4281          |
-| 6.0293        | 4.0   | 177  | 5.8969          |
-| 5.1002        | 4.99  | 221  | 5.3150          |
-| 4.5767        | 5.99  | 265  | 4.8598          |
-| 4.3085        | 6.98  | 309  | 4.6251          |
-| 4.1028        | 8.0   | 354  | 4.4794          |
-| 3.765         | 8.99  | 398  | 4.4287          |
-| 3.6087        | 9.99  | 442  | 4.3823          |
-| 3.4937        | 10.98 | 486  | 4.3547          |
-| 3.2339        | 12.0  | 531  | 4.3534          |
-| 3.0636        | 12.99 | 575  | 4.3613          |
-| 2.9827        | 13.99 | 619  | 4.3627          |
-| 3.0038        | 14.92 | 660  | 4.3606          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 7.0554
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
+- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 8
+- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 300
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.89  | 5    | 8.5606          |
+| No log        | 1.96  | 11   | 8.5292          |
+| No log        | 2.84  | 16   | 8.4846          |
+| 8.4864        | 3.91  | 22   | 8.4073          |
+| 8.4864        | 4.98  | 28   | 8.3008          |
+| 8.4864        | 5.87  | 33   | 8.1839          |
+| 8.4864        | 6.93  | 39   | 8.0042          |
+| 7.9713        | 8.0   | 45   | 7.7971          |
+| 7.9713        | 8.89  | 50   | 7.6271          |
+| 7.9713        | 9.96  | 56   | 7.4406          |
+| 7.1329        | 10.84 | 61   | 7.3101          |
+| 7.1329        | 11.91 | 67   | 7.1784          |
+| 7.1329        | 12.98 | 73   | 7.0835          |
+| 7.1329        | 13.33 | 75   | 7.0554          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6bbf983db3a389a9896edd6e16758396cd5c8f0a0885175848b5f5b82ebbaef3
 size 1344172280

 version https://git-lfs.github.com/spec/v1
+oid sha256:e203949035322b93f96b32e33a3b46f8c77bd3c483c213fc723687fa1c693a6f
 size 1344172280

runs/Mar20_16-10-51_gcn51.local.snellius.surf.nl/events.out.tfevents.1710947464.gcn51.local.snellius.surf.nl.1064152.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b3169131d0782747d9292c36ca86717f670f03a6cd87545b712936dbda8e8204
+size 8852

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ff767715fe6d836b914073aef0c4ff6866ea24394c87468c1b6d8cf0a0f8e6d0
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:1cd45b93b103b11a2d592bcdfebeb34897c8c0f1a33e8d2c49fad0f8943a6900
 size 4728