LDJnr/Puffin
Viewer • Updated • 3k • 1.35k • 96
How to use tim-d/CurtGPT with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("teknium/Puffin-Phi-v2")
model = PeftModel.from_pretrained(base_model, "tim-d/CurtGPT")
CurtGPTUsing Microsoft's Phi 1.5 model like it was never intended. |
|
This model is an adapter on puffin phi v2 trained using QLoRA and DPO on 60,000 samples from the anthropic helpful only dataset.
The following bitsandbytes quantization config was used during training: