Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
guoxz 's Collections
game
kernel
multimodal_math
image
multimodal-reasoning
security
eval
o1_like
cot
agent
voice
rl
instruction_with_rationale
instruction
math
code
pretrain
Multimodality
med
func_call
role
law

rl

updated 6 days ago
Upvote
-

  • Skywork/Skywork-Reward-Preference-80K-v0.1

    Viewer • Updated Oct 25, 2024 • 82k • 87 • 45

  • mlabonne/open-perfectblend

    Viewer • Updated Jan 15, 2025 • 1.42M • 424 • 62

  • allenai/llama-3.1-tulu-3-70b-preference-mixture

    Viewer • Updated Feb 4, 2025 • 337k • 146 • 19

  • OpenLeecher/lmsys_chat_1m_clean

    Viewer • Updated Dec 31, 2024 • 273k • 201 • 82

  • opencsg/UltraFeedback-chinese

    Preview • Updated Jan 14, 2025 • 175 • 13

  • HumanLLMs/Human-Like-DPO-Dataset

    Viewer • Updated 13 days ago • 10.9k • 512 • 243

  • virtuoussy/Multi-subject-RLVR

    Viewer • Updated Apr 16, 2025 • 579k • 87 • 67

  • MiniMaxAI/SynLogic

    Viewer • Updated Jul 2, 2025 • 49.3k • 607 • 100

  • nvidia/HelpSteer3

    Viewer • Updated Nov 16, 2025 • 133k • 2.47k • 95

  • sojuL/RubricHub_v1

    Viewer • Updated 8 days ago • 364k • 581 • 128
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs