allenai/scitldr
Viewer • Updated • 9.69k • 1.08k • 35
How to use pkbiswas/DeepSeek-R1-Distill-Llama-8B-Summarization-QLoRa with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-R1-Distill-Llama-8B")
model = PeftModel.from_pretrained(base_model, "pkbiswas/DeepSeek-R1-Distill-Llama-8B-Summarization-QLoRa")This model is a fine-tuned version of deepseek-ai/DeepSeek-R1-Distill-Llama-8B on the scitldr dataset. It achieves the following results on the evaluation set:
DeepSeek-R1-Distill-Llama-8B fine-tuned for summarization of scientific documents
Summarization
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 2.459 | 0.2209 | 220 | 2.4903 |
| 2.3971 | 0.4418 | 440 | 2.4720 |
| 2.3821 | 0.6627 | 660 | 2.4550 |
| 2.3665 | 0.8835 | 880 | 2.4392 |
| 2.3582 | 1.1044 | 1100 | 2.5203 |
| 1.7824 | 1.3253 | 1320 | 2.5360 |
| 1.7599 | 1.5462 | 1540 | 2.5486 |
| 1.7352 | 1.7671 | 1760 | 2.5404 |
| 1.7088 | 1.9880 | 1980 | 2.5393 |
Base model
deepseek-ai/DeepSeek-R1-Distill-Llama-8B