DPO datasets
updated
Viewer
• Updated • 7.5k • 2.56k
• 173
argilla/distilabel-capybara-dpo-7k-binarized
Viewer
• Updated • 7.56k • 18.3k
• 183
llamafactory/DPO-En-Zh-20k
Viewer
• Updated • 20k • 539
• 102
argilla/distilabel-intel-orca-dpo-pairs
Viewer
• Updated • 12.9k • 23.3k
• 183
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated • 60.9k • 14.7k
• 162
argilla/distilabel-math-preference-dpo
Viewer
• Updated • 2.42k • 4.97k
• 88
M4-ai/prm_dpo_pairs_cleaned
Viewer
• Updated • 7.99k • 123
• 11
jondurbin/truthy-dpo-v0.1
Viewer
• Updated • 1.02k • 878
• 136
YeungNLP/ultrafeedback_binarized
Viewer
• Updated • 63.1k • 98
• 1
shibing624/DPO-En-Zh-20k-Preference
Viewer
• Updated • 20k • 206
• 18
Preview
• Updated • 129
• 6
mlabonne/orpo-dpo-mix-40k
Viewer
• Updated • 44.2k • 2.34k
• 302
Viewer
• Updated • 15.3k • 153
• 19
jondurbin/gutenberg-dpo-v0.1
Viewer
• Updated • 918 • 1.27k
• 164
CyberNative/Code_Vulnerability_Security_DPO
Viewer
• Updated • 4.66k • 5.06k
• 159
mlabonne/orpo-dpo-mix-40k-flat
Viewer
• Updated • 44.2k • 127
• 14
selimc/orpo-dpo-mix-TR-20k
Viewer
• Updated • 19.9k • 96
• 7
efederici/alpaca-vs-alpaca-orpo-dpo
Viewer
• Updated • 49.2k • 163
• 7
Viewer
• Updated • 2.42k • 382
• 10
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer
• Updated • 273k • 5.86k
• 26
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer
• Updated • 337k • 517
• 19
HuggingFaceH4/ultrafeedback_binarized
Viewer
• Updated • 187k • 17k
• 338
Preview
• Updated • 1.8k
• 215
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer
• Updated • 361k • 91
• 6
Viewer
• Updated • 450k • 41.3k
• 751
qihoo360/Light-R1-DPOData
Viewer
• Updated • 2.97k • 197
• 29