DPO datasets
updated
Viewer
• Updated
• 7.5k • 540
• 171
argilla/distilabel-capybara-dpo-7k-binarized
Viewer
• Updated
• 7.56k • 2.34k
• 182
llamafactory/DPO-En-Zh-20k
Viewer
• Updated
• 20k • 320
• 97
argilla/distilabel-intel-orca-dpo-pairs
Viewer
• Updated
• 12.9k • 4.73k
• 181
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated
• 60.9k • 5.27k
• 161
argilla/distilabel-math-preference-dpo
Viewer
• Updated
• 2.42k • 333
• 88
M4-ai/prm_dpo_pairs_cleaned
Viewer
• Updated
• 7.99k • 77
• 11
jondurbin/truthy-dpo-v0.1
Viewer
• Updated
• 1.02k • 796
• 136
YeungNLP/ultrafeedback_binarized
Viewer
• Updated
• 63.1k • 47
• 1
shibing624/DPO-En-Zh-20k-Preference
Viewer
• Updated
• 20k • 90
• 17
Preview
• Updated
• 78
• 6
mlabonne/orpo-dpo-mix-40k
Viewer
• Updated
• 44.2k • 340
• 300
Viewer
• Updated
• 15.3k • 47
• 19
jondurbin/gutenberg-dpo-v0.1
Viewer
• Updated
• 918 • 570
• 158
CyberNative/Code_Vulnerability_Security_DPO
Viewer
• Updated
• 4.66k • 642
• 148
mlabonne/orpo-dpo-mix-40k-flat
Viewer
• Updated
• 44.2k • 83
• 14
selimc/orpo-dpo-mix-TR-20k
Viewer
• Updated
• 19.9k • 34
• 7
efederici/alpaca-vs-alpaca-orpo-dpo
Viewer
• Updated
• 49.2k • 109
• 7
Viewer
• Updated
• 2.42k • 19
• 10
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer
• Updated
• 273k • 1.98k
• 26
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer
• Updated
• 337k • 1.63k
• 19
HuggingFaceH4/ultrafeedback_binarized
Viewer
• Updated
• 187k • 6.55k
• 324
Preview
• Updated
• 1.13k
• 208
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer
• Updated
• 361k • 49
• 6
Viewer
• Updated
• 450k • 11k
• 716
qihoo360/Light-R1-DPOData
Viewer
• Updated
• 2.97k • 102
• 28