DPO datasets
updated
Viewer
• Updated • 7.5k • 2.53k
• 173
argilla/distilabel-capybara-dpo-7k-binarized
Viewer
• Updated • 7.56k • 18.2k
• 183
llamafactory/DPO-En-Zh-20k
Viewer
• Updated • 20k • 528
• 102
argilla/distilabel-intel-orca-dpo-pairs
Viewer
• Updated • 12.9k • 23.1k
• 183
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated • 60.9k • 14.3k
• 162
argilla/distilabel-math-preference-dpo
Viewer
• Updated • 2.42k • 5.02k
• 88
M4-ai/prm_dpo_pairs_cleaned
Viewer
• Updated • 7.99k • 123
• 11
jondurbin/truthy-dpo-v0.1
Viewer
• Updated • 1.02k • 878
• 136
YeungNLP/ultrafeedback_binarized
Viewer
• Updated • 63.1k • 99
• 1
shibing624/DPO-En-Zh-20k-Preference
Viewer
• Updated • 20k • 203
• 18
Preview
• Updated • 125
• 6
mlabonne/orpo-dpo-mix-40k
Viewer
• Updated • 44.2k • 2.38k
• 302
Viewer
• Updated • 15.3k • 151
• 19
jondurbin/gutenberg-dpo-v0.1
Viewer
• Updated • 918 • 1.26k
• 164
CyberNative/Code_Vulnerability_Security_DPO
Viewer
• Updated • 4.66k • 4.94k
• 159
mlabonne/orpo-dpo-mix-40k-flat
Viewer
• Updated • 44.2k • 127
• 14
selimc/orpo-dpo-mix-TR-20k
Viewer
• Updated • 19.9k • 96
• 7
efederici/alpaca-vs-alpaca-orpo-dpo
Viewer
• Updated • 49.2k • 161
• 7
Viewer
• Updated • 2.42k • 382
• 10
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer
• Updated • 273k • 5.65k
• 26
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer
• Updated • 337k • 524
• 19
HuggingFaceH4/ultrafeedback_binarized
Viewer
• Updated • 187k • 16.7k
• 338
Preview
• Updated • 1.76k
• 215
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer
• Updated • 361k • 91
• 6
Viewer
• Updated • 450k • 40.1k
• 751
qihoo360/Light-R1-DPOData
Viewer
• Updated • 2.97k • 202
• 29