liuylhf
/

mixtral-lora

Generated from Trainer

4-bit precision

Model card Files Files and versions

liuylhf commited on Mar 2, 2024

Commit

5c71a83

·

verified ·

1 Parent(s): 03140d5

End of training

Files changed (2) hide show

README.md +30 -1
adapter_model.bin +2 -2

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
@@ -117,7 +118,9 @@ weight_decay: 0
 # mixtral-lora
-This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 ## Model description
@@ -150,6 +153,32 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 2
 ### Framework versions
 - PEFT 0.8.2

 license: apache-2.0
 library_name: peft
 tags:
+- axolotl
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
 # mixtral-lora
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4469
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 2
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 3.3559        | 0.0   | 1    | 3.6627          |
+| 0.548         | 0.1   | 43   | 0.5124          |
+| 0.2747        | 0.2   | 86   | 0.4845          |
+| 0.4202        | 0.31  | 129  | 0.4760          |
+| 0.4662        | 0.41  | 172  | 0.4690          |
+| 0.4605        | 0.51  | 215  | 0.4640          |
+| 0.2909        | 0.61  | 258  | 0.4620          |
+| 0.3941        | 0.71  | 301  | 0.4600          |
+| 0.4185        | 0.82  | 344  | 0.4573          |
+| 0.395         | 0.92  | 387  | 0.4558          |
+| 0.2725        | 1.0   | 430  | 0.4534          |
+| 0.2789        | 1.1   | 473  | 0.4525          |
+| 0.4126        | 1.21  | 516  | 0.4511          |
+| 0.3277        | 1.31  | 559  | 0.4506          |
+| 0.3591        | 1.41  | 602  | 0.4493          |
+| 0.3665        | 1.51  | 645  | 0.4487          |
+| 0.5551        | 1.62  | 688  | 0.4471          |
+| 0.3363        | 1.72  | 731  | 0.4473          |
+| 0.3117        | 1.82  | 774  | 0.4471          |
+| 0.496         | 1.92  | 817  | 0.4469          |
 ### Framework versions
 - PEFT 0.8.2

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1ebf04022682725adb1801cb9308dfda7cbf3ad82624b6da1d66c691e873a6b1
-size 1938497058

 version https://git-lfs.github.com/spec/v1
+oid sha256:3acbe23bf41e60f8d5a30cb06a7c41a6d155e59af7c1563c4d945cf45e90c9b6
+size 109144714