liuylhf
/

mixtral-remove-negative-data

Generated from Trainer

4-bit precision

Model card Files Files and versions

liuylhf commited on Mar 5, 2024

Commit

7a6c7d4

·

verified ·

1 Parent(s): 0631ff4

End of training

Files changed (2) hide show

README.md +15 -1
adapter_model.bin +3 -0

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
@@ -92,7 +93,9 @@ xformers_attention: null
 # mixtral-remove-negative-data
-This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 ## Model description
@@ -125,6 +128,17 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 1
 ### Framework versions
 - PEFT 0.8.2

 license: apache-2.0
 library_name: peft
 tags:
+- axolotl
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
 # mixtral-remove-negative-data
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0936
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 1
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 4.0791        | 0.0   | 1    | 4.0657          |
+| 0.1507        | 0.2   | 56   | 0.1291          |
+| 0.1178        | 0.4   | 112  | 0.1052          |
+| 0.1046        | 0.61  | 168  | 0.0977          |
+| 0.102         | 0.81  | 224  | 0.0936          |
 ### Framework versions
 - PEFT 0.8.2

adapter_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:64c92fbfd3a2017ede1b278af4af27f73e13f5760bdcded2ab11e65f916f95ab
+size 109144714