LM Preference datas
updated
Viewer
•
Updated
•
183k
•
700
•
295
mlabonne/chatml_dpo_pairs
Viewer
•
Updated
•
12.9k
•
78
•
54
HuggingFaceH4/ultrachat_200k
Viewer
•
Updated
•
515k
•
30.6k
•
643
Viewer
•
Updated
•
12.9k
•
1.69k
•
319
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
2.55k
•
159
argilla/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
642
•
88
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
•
164k
•
7.51k
•
173
lvwerra/stack-exchange-paired
Viewer
•
Updated
•
31.3M
•
710
•
147
Viewer
•
Updated
•
169k
•
21.6k
•
1.65k
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
•
1.02k
•
173
•
135
Viewer
•
Updated
•
2.02k
•
45
•
15
Viewer
•
Updated
•
445k
•
96
•
100
Viewer
•
Updated
•
37.1k
•
7.17k
•
245
Viewer
•
Updated
•
7.5k
•
251
•
170
Viewer
•
Updated
•
1.11M
•
12.6k
•
221
openbmb/UltraInteract_sft
Viewer
•
Updated
•
289k
•
527
•
126
allenai/olmo-2-0325-32b-preference-mix
Updated
•
143
•
15
PrimeIntellect/SYNTHETIC-2-Base-Answer-Critique
Viewer
•
Updated
•
50k
•
6
•
2