Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
honggen
/
hard_dpo
like
0
Text Generation
Anthropic/hh-rlhf
English
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
hard_dpo
11.2 GB
1 contributor
History:
3 commits
honggen
Create README.md
bd014a4
verified
almost 2 years ago
.gitattributes
1.52 kB
initial commit
almost 2 years ago
README.md
176 Bytes
Create README.md
almost 2 years ago
policy.pt
11.2 GB
xet
Upload policy.pt
almost 2 years ago