tatsu-lab/alpaca
Viewer • Updated • 52k • 102k • 985
How to use 4x32/mistral-7b-alpaca-qlora with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-v0.1")
model = PeftModel.from_pretrained(base_model, "4x32/mistral-7b-alpaca-qlora")Fine-tuned version of mistralai/Mistral-7B-v0.1 using QLoRA on the Alpaca dataset.
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch
tokenizer = AutoTokenizer.from_pretrained("4x32/mistral-7b-alpaca-qlora")
base = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-v0.1", torch_dtype=torch.bfloat16)
model = PeftModel.from_pretrained(base, "4x32/mistral-7b-alpaca-qlora")
prompt = (
"Below is an instruction that describes a task. "
"Write a response that appropriately completes the request.\n\n"
"### Instruction:\nExplain what a neural network is.\n\n"
"### Response:\n"
)
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Base model
mistralai/Mistral-7B-v0.1