jeffliulab's picture
Initial deploy
b48305f verified

A newer version of the Gradio SDK is available: 6.12.0

Upgrade
metadata
title: DPO Preference Voting
emoji: 🗳️
colorFrom: yellow
colorTo: green
sdk: gradio
sdk_version: 5.29.0
app_file: app.py
pinned: false
license: mit

DPO Preference Voting

Label preference pairs just like human annotators do for DPO training.

Course: 380 LLM Post-training ch4 — DPO