allenai/scitldr
Viewer • Updated • 9.69k • 1.15k • 35
How to use pkbiswas/Mistral-7B-Summarization-QLoRa with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2")
model = PeftModel.from_pretrained(base_model, "pkbiswas/Mistral-7B-Summarization-QLoRa")This model is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 on the scitldr dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 2.0732 | 0.1 | 200 | 2.1863 |
| 2.1324 | 0.2 | 400 | 2.1925 |
| 2.103 | 0.3 | 600 | 2.1876 |
| 2.0766 | 0.4 | 800 | 2.1737 |
| 2.0825 | 0.5 | 1000 | 2.1555 |
| 2.0731 | 0.6 | 1200 | 2.1465 |
| 2.0819 | 0.7 | 1400 | 2.1355 |
| 1.9802 | 0.8 | 1600 | 2.1223 |
| 2.0466 | 0.9 | 1800 | 2.1059 |
Base model
mistralai/Mistral-7B-Instruct-v0.2