bitext/Bitext-customer-support-llm-chatbot-training-dataset
Viewer • Updated • 26.9k • 7.75k • 167
How to use anezatra/zephyr-7B-alpha-GPTQ with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("TheBloke/zephyr-7B-alpha-GPTQ")
model = PeftModel.from_pretrained(base_model, "anezatra/zephyr-7B-alpha-GPTQ")Large language models have achieved groundbreaking success in the field of natural language processing (NLP). However, since these models are generally trained for general-purpose tasks, they may not perform optimally for specific tasks. Therefore, fine-tuning these large models for specific tasks is a common practice. In this article, we will delve into the process of fine-tuning and adapting the Zephyr-7B-alpha-GPTQ, a large language model, for a particular task.
The following hyperparameters were used during training:
Base model
mistralai/Mistral-7B-v0.1