Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
JERRYPAN617
/
HH-BTRewardModel-roberta
like
1
Reinforcement Learning
Safetensors
Anthropic/hh-rlhf
English
roberta
safety
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
HH-BTRewardModel-roberta
503 MB
1 contributor
History:
4 commits
JERRYPAN617
Update README.md
b65bb1b
verified
4 months ago
.gitattributes
1.52 kB
initial commit
4 months ago
README.md
880 Bytes
Update README.md
4 months ago
config.json
723 Bytes
Upload folder using huggingface_hub
4 months ago
merges.txt
456 kB
Upload folder using huggingface_hub
4 months ago
model.safetensors
499 MB
xet
Upload folder using huggingface_hub
4 months ago
special_tokens_map.json
280 Bytes
Upload folder using huggingface_hub
4 months ago
tokenizer.json
3.56 MB
Upload folder using huggingface_hub
4 months ago
tokenizer_config.json
1.25 kB
Upload folder using huggingface_hub
4 months ago
vocab.json
798 kB
Upload folder using huggingface_hub
4 months ago