Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
transZ
's Collections
Reward model
Good data
Reward model
updated
3 days ago
Reward modelling
Upvote
-
RLHFlow/SHP-standard
Viewer
•
Updated
May 9, 2024
•
93.3k
•
8
Note
Training
transZ/shp
Viewer
•
Updated
4 days ago
•
10.3k
•
14
Note
Test and validation
RLHFlow/HH-RLHF-Helpful-standard
Viewer
•
Updated
Apr 27, 2024
•
115k
•
102
•
3
Note
Training
transZ/anthropic_helpful_test
Viewer
•
Updated
3 days ago
•
2.33k
•
12
Note
Test
RLHFlow/HH-RLHF-Harmless-and-RedTeam-standard
Viewer
•
Updated
May 8, 2024
•
42.3k
•
13
•
4
Note
Training
transZ/anthropic_harmless_test
Viewer
•
Updated
3 days ago
•
2.3k
•
12
Note
Test
Upvote
-
Share collection
View history
Collection guide
Browse collections