Chat-Error
/

Kimiko-Mistral-7B

Text Generation

Generated from Trainer

4-bit precision

Model card Files Files and versions

nRuaif commited on Sep 30, 2023

Commit

416e9d2

·

1 Parent(s): 6c3a385

Update README.md

Files changed (1) hide show

README.md +5 -8

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ base_model: mistralai/Mistral-7B-v0.1
 tags:
 - generated_from_trainer
 model-index:
-- name: aesir-rpg-mistral-out
   results: []
 ---
@@ -12,7 +12,7 @@ model-index:
 should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
-# aesir-rpg-mistral-out
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
@@ -20,22 +20,19 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42

 tags:
 - generated_from_trainer
 model-index:
+- name: Kimiko-Mistral-7B
   results: []
 ---
 should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
+# Kimiko-Mistral-7B
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
 ## Model description
+Same dataset as Kimiko-v2 but on new model. THIS IS NOT TRAIN ON V3 DATASET
 ## Intended uses & limitations
+As a finetuning experiment on new 7B model. You can use this for roleplay or as an assistant
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.00005
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42