Spaces:
Sleeping
Sleeping
A newer version of the Gradio SDK is available: 6.12.0
metadata
title: DPO Preference Voting
emoji: 🗳️
colorFrom: yellow
colorTo: green
sdk: gradio
sdk_version: 5.29.0
app_file: app.py
pinned: false
license: mit
DPO Preference Voting
Label preference pairs just like human annotators do for DPO training.
Course: 380 LLM Post-training ch4 — DPO