Reward Modeling Datasets
updated
Viewer
•
Updated
•
37.1k
•
3.75k
•
243
Viewer
•
Updated
•
169k
•
23.5k
•
1.56k
Viewer
•
Updated
•
386k
•
5.06k
•
320
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
•
164k
•
6.55k
•
168
openai/webgpt_comparisons
Viewer
•
Updated
•
19.6k
•
4.22k
•
238
openai/summarize_from_feedback
Viewer
•
Updated
•
194k
•
3.51k
•
215
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
•
187k
•
9.26k
•
316
Viewer
•
Updated
•
183k
•
677
•
294
HuggingFaceH4/stack-exchange-preferences
Viewer
•
Updated
•
10.8M
•
3.13k
•
133
HuggingFaceH4/hhh_alignment
Viewer
•
Updated
•
221
•
363
•
21
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
•
1.09M
•
136
•
45
prometheus-eval/Feedback-Collection
Viewer
•
Updated
•
100k
•
1.14k
•
116
argilla/OpenHermesPreferences
Viewer
•
Updated
•
989k
•
1.38k
•
210
Viewer
•
Updated
•
8.11k
•
5.08k
•
103
Viewer
•
Updated
•
21.4k
•
7.68k
•
434
Magpie-Align/Magpie-Pro-DPO-200K
Viewer
•
Updated
•
207k
•
23
•
7
argilla/magpie-ultra-v0.1
Viewer
•
Updated
•
50k
•
565
•
221