argilla/dpo-mix-7k
Viewer • Updated • 7.5k • 2.2k • 173
How to use eren23/DPOMixLLama-3-8B-lora with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Meta-Llama-3-8B-Instruct")
model = PeftModel.from_pretrained(base_model, "eren23/DPOMixLLama-3-8B-lora")A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k
Base model
meta-llama/Meta-Llama-3-8B-Instruct