How to use osbm/llama-7b-4bit with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("baffo32/decapoda-research-llama-7B-hf") model = PeftModel.from_pretrained(base_model, "osbm/llama-7b-4bit")
The community tab is the place to discuss and collaborate with the HF community!