HuggingFaceH4/cai-conversation-harmless
Viewer • Updated • 44.8k • 446 • 17
How to use saffr0n/ccai-sft-qlora with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2")
model = PeftModel.from_pretrained(base_model, "saffr0n/ccai-sft-qlora")This model is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 on the HuggingFaceH4/cai-conversation-harmless dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.6894 | 0.99 | 38 | 0.6977 |
Base model
mistralai/Mistral-7B-Instruct-v0.2