End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7291
 ## Model description
@@ -43,21 +43,17 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
-- num_epochs: 8
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.5764        | 1.0   | 755  | 0.6950          |
-| 0.5964        | 2.0   | 1510 | 0.6837          |
-| 0.5912        | 3.0   | 2265 | 0.6830          |
-| 0.6013        | 4.0   | 3020 | 0.6878          |
-| 0.5122        | 5.0   | 3775 | 0.6994          |
-| 0.4787        | 6.0   | 4530 | 0.7133          |
-| 0.4859        | 7.0   | 5285 | 0.7235          |
-| 0.4924        | 8.0   | 6040 | 0.7291          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6810
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
+- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.5762        | 1.0   | 755  | 0.6938          |
+| 0.5946        | 2.0   | 1510 | 0.6818          |
+| 0.5896        | 3.0   | 2265 | 0.6801          |
+| 0.6087        | 4.0   | 3020 | 0.6810          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9480a32629005365e8819af7727ad5a24576b7951d949e7aa5340e20bf9fa669
 size 1182877280

 version https://git-lfs.github.com/spec/v1
+oid sha256:30d368f259a32611a18a92fb97b8039733bbc0e539c81108ebe0ab32cc5d9306
 size 1182877280

runs/Apr10_05-36-32_DESKTOP-5GR7SN9/events.out.tfevents.1712698598.DESKTOP-5GR7SN9.21868.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:cc37731cd8c024843dcbca7ee03a49d65d13e32de8308ddc37c167faa0190e79
+size 31794

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:58a7a7afc26749fc081aac48feb01b6764c990bb8e9b9aff9b77bda2d2a4826b
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:cd5cbf31b9472a65764e0bff24a1d50c92ca6541e472ea4f794d6fee281924a2
 size 4920