allenai/scitldr
Viewer • Updated • 9.69k • 996 • 35
How to use pkbiswas/Bloom-560m-Summarization-QLoRa with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("bigscience/bloom-560m")
model = PeftModel.from_pretrained(base_model, "pkbiswas/Bloom-560m-Summarization-QLoRa")This model is a fine-tuned version of bigscience/bloom-560m on the scitldr dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 2.8487 | 0.2510 | 500 | 2.9019 |
| 2.8069 | 0.5020 | 1000 | 2.8799 |
| 2.8195 | 0.7530 | 1500 | 2.8660 |
| 2.8024 | 1.0040 | 2000 | 2.8556 |
| 2.661 | 1.2550 | 2500 | 2.8637 |
| 2.6136 | 1.5060 | 3000 | 2.8608 |
| 2.5816 | 1.7570 | 3500 | 2.8578 |
Base model
bigscience/bloom-560m