theoldmandthesea/17k_business_book
Viewer • Updated • 17.5k • 28 • 15
How to use VijayRam1812/Mistral-7B-Business with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.1")
model = PeftModel.from_pretrained(base_model, "VijayRam1812/Mistral-7B-Business")This model is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.1 on the theoldmandthesea/17k_business_book dataset. It achieves the following results on the evaluation set:
Mainly intended to be used at the backend of a chatbot application meant to assist children and teenager with knowledge of business and finances.
Total number of samples: 17480
Number of Training Samples: 13984
Number of Validation / Test Samples: 3496
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.6197 | 0.1869 | 20 | 0.6099 |
| 0.5927 | 0.3738 | 40 | 0.5905 |
| 0.5664 | 0.5607 | 60 | 0.5824 |
| 0.572 | 0.7477 | 80 | 0.5767 |
| 0.5817 | 0.9346 | 100 | 0.5729 |