Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
guoxz
's Collections
game
kernel
multimodal_math
image
multimodal-reasoning
security
eval
o1_like
cot
agent
voice
rl
instruction_with_rationale
instruction
math
code
pretrain
Multimodality
med
func_call
role
law
rl
updated
6 days ago
Upvote
-
Skywork/Skywork-Reward-Preference-80K-v0.1
Viewer
•
Updated
Oct 25, 2024
•
82k
•
87
•
45
mlabonne/open-perfectblend
Viewer
•
Updated
Jan 15, 2025
•
1.42M
•
424
•
62
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer
•
Updated
Feb 4, 2025
•
337k
•
146
•
19
OpenLeecher/lmsys_chat_1m_clean
Viewer
•
Updated
Dec 31, 2024
•
273k
•
201
•
82
opencsg/UltraFeedback-chinese
Preview
•
Updated
Jan 14, 2025
•
175
•
13
HumanLLMs/Human-Like-DPO-Dataset
Viewer
•
Updated
13 days ago
•
10.9k
•
512
•
243
virtuoussy/Multi-subject-RLVR
Viewer
•
Updated
Apr 16, 2025
•
579k
•
87
•
67
MiniMaxAI/SynLogic
Viewer
•
Updated
Jul 2, 2025
•
49.3k
•
607
•
100
nvidia/HelpSteer3
Viewer
•
Updated
Nov 16, 2025
•
133k
•
2.47k
•
95
sojuL/RubricHub_v1
Viewer
•
Updated
8 days ago
•
364k
•
581
•
128
Upvote
-
Share collection
View history
Collection guide
Browse collections