Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
qqliangqi
's Collections
Cite
agent
planning
IFT
RLHF
sft
pre-train
some benchmark
RLHF
updated
Aug 5, 2025
Upvote
-
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
21.3k
•
1.65k
Upvote
-
Share collection
View history
Collection guide
Browse collections