allenai/scitldr
Viewer • Updated • 9.69k • 1.08k • 35
How to use pkbiswas/Phi-3-Summarization-QLoRa with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-3-mini-128k-instruct")
model = PeftModel.from_pretrained(base_model, "pkbiswas/Phi-3-Summarization-QLoRa")This model is a fine-tuned version of microsoft/Phi-3-mini-128k-instruct on the scitldr dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 2.0689 | 0.2510 | 500 | 2.1439 |
| 2.0455 | 0.5020 | 1000 | 2.1388 |
| 2.0665 | 0.7530 | 1500 | 2.1349 |
| 2.0481 | 1.0040 | 2000 | 2.1308 |
| 1.9051 | 1.2550 | 2500 | 2.1573 |
| 1.8524 | 1.5060 | 3000 | 2.1588 |
| 1.8247 | 1.7570 | 3500 | 2.1554 |
Base model
microsoft/Phi-3-mini-128k-instruct