Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
eZWALT 's Collections
Multimodal NanoChimera
Pretraining Corpora
RLHF Resources
Cursed Toxic Pretraining Corpora

RLHF Resources

updated Oct 21, 2025
Upvote
-

  • HuggingFaceTB/SmolLM2-135M-Instruct

    Text Generation • 0.1B • Updated Sep 22, 2025 • 755k • 301

  • Anthropic/hh-rlhf

    Viewer • Updated May 26, 2023 • 169k • 20.2k • 1.67k

  • allenai/ultrafeedback_binarized_cleaned_train

    Viewer • Updated Aug 28, 2024 • 61.8k • 31 • 1

  • arnir0/Tiny-LLM

    Text Generation • 13M • Updated Nov 3, 2024 • 40.5k • 46

  • trl-lib/hh-rlhf-helpful-base

    Viewer • Updated Jan 8, 2025 • 46.2k • 339 • 3

  • yitingxie/rlhf-reward-datasets

    Viewer • Updated Jan 1, 2023 • 81.4k • 91 • 65
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs