LM Preference datas
updated
Viewer
• Updated
• 183k • 896
• 295
mlabonne/chatml_dpo_pairs
Viewer
• Updated
• 12.9k • 106
• 55
HuggingFaceH4/ultrachat_200k
Viewer
• Updated
• 515k • 38.1k
• 666
Viewer
• Updated
• 12.9k • 2.48k
• 320
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated
• 60.9k • 5.66k
• 161
argilla/distilabel-math-preference-dpo
Viewer
• Updated
• 2.42k • 317
• 88
PKU-Alignment/PKU-SafeRLHF
Viewer
• Updated
• 164k • 13.6k
• 178
lvwerra/stack-exchange-paired
Viewer
• Updated
• 31.3M • 3k
• 148
Viewer
• Updated
• 169k • 22.1k
• 1.68k
jondurbin/truthy-dpo-v0.1
Viewer
• Updated
• 1.02k • 824
• 136
Viewer
• Updated
• 2.02k • 73
• 15
Viewer
• Updated
• 445k • 824
• 101
Viewer
• Updated
• 37.1k • 1.56k
• 247
Viewer
• Updated
• 7.5k • 2.15k
• 171
Viewer
• Updated
• 1.11M • 16.3k
• 233
openbmb/UltraInteract_sft
Viewer
• Updated
• 289k • 523
• 126
allenai/olmo-2-0325-32b-preference-mix
Updated
• 122
• 15
PrimeIntellect/SYNTHETIC-2-Base-Answer-Critique
Viewer
• Updated
• 50k • 9
• 2