Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
little-jack
's Collections
Cite
agent
planning
IFT
RLHF
sft
pre-train
some benchmark
RLHF
updated
Aug 5, 2025
Upvote
-
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
19.5k
•
1.67k
Upvote
-
Share collection
View history
Collection guide
Browse collections