ninagroot/Llama-360Mtest

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.2830
 ## Model description
@@ -41,17 +41,25 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 100
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 6.0238        | 1.0   | 69   | 5.9202          |
-| 5.2632        | 2.0   | 138  | 4.8940          |
-| 4.012         | 3.0   | 207  | 4.3873          |
-| 3.7681        | 4.0   | 276  | 4.2830          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.3046
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 100
+- num_epochs: 12
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 6.0662        | 1.0   | 69   | 5.8781          |
+| 5.1419        | 2.0   | 138  | 4.9025          |
+| 4.1271        | 3.0   | 207  | 4.4935          |
+| 3.8908        | 4.0   | 276  | 4.3523          |
+| 3.5293        | 5.0   | 345  | 4.2722          |
+| 3.322         | 6.0   | 414  | 4.2443          |
+| 2.8975        | 7.0   | 483  | 4.2451          |
+| 2.6264        | 8.0   | 552  | 4.2609          |
+| 2.346         | 9.0   | 621  | 4.2915          |
+| 1.9401        | 10.0  | 690  | 4.2793          |
+| 1.7366        | 11.0  | 759  | 4.3004          |
+| 1.676         | 12.0  | 828  | 4.3046          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:17e02de6cf127178a8946722392615d442e0df7968a1e9ab13a0d4d88d6e43af
 size 1344172280

 version https://git-lfs.github.com/spec/v1
+oid sha256:4486339afec1cf614855ff7c7c981cd6604026e99f59f47a8c5557e39d874a23
 size 1344172280

runs/Mar22_13-12-25_gcn31.local.snellius.surf.nl/events.out.tfevents.1711109557.gcn31.local.snellius.surf.nl.998404.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:617e4d7c87c4cf8d30845b05a998fc1900e1355204f888e45cc22bbf49d70ae1
+size 14339

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:73705c12fe404124335ca72adf269875cc3fb90a1d3dea5008ffa660c0f4f512
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:92d7d7ba48657e43574e3d1deedfdd10bdc5b34e7b372b3aec112b1d0ce75abd
 size 4728