Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
transZ 's Collections
Reward model
Good data

Reward model

updated 3 days ago

Reward modelling

Upvote
-

  • RLHFlow/SHP-standard

    Viewer • Updated May 9, 2024 • 93.3k • 8

    Note Training


  • transZ/shp

    Viewer • Updated 4 days ago • 10.3k • 14

    Note Test and validation


  • RLHFlow/HH-RLHF-Helpful-standard

    Viewer • Updated Apr 27, 2024 • 115k • 102 • 3

    Note Training


  • transZ/anthropic_helpful_test

    Viewer • Updated 3 days ago • 2.33k • 12

    Note Test


  • RLHFlow/HH-RLHF-Harmless-and-RedTeam-standard

    Viewer • Updated May 8, 2024 • 42.3k • 13 • 4

    Note Training


  • transZ/anthropic_harmless_test

    Viewer • Updated 3 days ago • 2.3k • 12

    Note Test

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs