Text Generation
PEFT
Safetensors
English
dialogue
gricean-maxims
cooperative-communication
lora
dpo
direct-preference-optimization
gpt2
nlp
Eval Results (legacy)
Instructions to use Pushkar27/GriceBench-DPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Pushkar27/GriceBench-DPO with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("gpt2-medium") model = PeftModel.from_pretrained(base_model, "Pushkar27/GriceBench-DPO") - Notebooks
- Google Colab
- Kaggle