DPO datasets
updated
Viewer
• Updated • 7.5k • 1.77k
• 173
argilla/distilabel-capybara-dpo-7k-binarized
Viewer
• Updated • 7.56k • 18.1k
• 183
llamafactory/DPO-En-Zh-20k
Viewer
• Updated • 20k • 501
• 102
argilla/distilabel-intel-orca-dpo-pairs
Viewer
• Updated • 12.9k • 24.2k
• 183
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated • 60.9k • 15.3k
• 162
argilla/distilabel-math-preference-dpo
Viewer
• Updated • 2.42k • 4.23k
• 88
M4-ai/prm_dpo_pairs_cleaned
Viewer
• Updated • 7.99k • 134
• 11
jondurbin/truthy-dpo-v0.1
Viewer
• Updated • 1.02k • 916
• 136
YeungNLP/ultrafeedback_binarized
Viewer
• Updated • 63.1k • 94
• 1
shibing624/DPO-En-Zh-20k-Preference
Viewer
• Updated • 20k • 206
• 18
Preview
• Updated • 147
• 6
mlabonne/orpo-dpo-mix-40k
Viewer
• Updated • 44.2k • 1.73k
• 302
Viewer
• Updated • 15.3k • 148
• 19
jondurbin/gutenberg-dpo-v0.1
Viewer
• Updated • 918 • 1.08k
• 164
CyberNative/Code_Vulnerability_Security_DPO
Viewer
• Updated • 4.66k • 5.19k
• 159
mlabonne/orpo-dpo-mix-40k-flat
Viewer
• Updated • 44.2k • 121
• 14
selimc/orpo-dpo-mix-TR-20k
Viewer
• Updated • 19.9k • 112
• 7
efederici/alpaca-vs-alpaca-orpo-dpo
Viewer
• Updated • 49.2k • 166
• 7
Viewer
• Updated • 2.42k • 379
• 10
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer
• Updated • 273k • 5.97k
• 26
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer
• Updated • 337k • 499
• 19
HuggingFaceH4/ultrafeedback_binarized
Viewer
• Updated • 187k • 16.8k
• 338
Preview
• Updated • 1.67k
• 215
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer
• Updated • 361k • 92
• 6
Viewer
• Updated • 450k • 45.3k
• 752
qihoo360/Light-R1-DPOData
Viewer
• Updated • 2.97k • 127
• 29