Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
georgebu
/
reward_model
like
0
Text Classification
Transformers
Safetensors
HumanLLMs/Human-Like-DPO-Dataset
English
llama
trl
reward-trainer
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
reward_model
/
README.md
Commit History
Update README.md
d2c46b2
verified
georgebu
commited on
Mar 28, 2025
Update README.md
3f85e92
verified
georgebu
commited on
Mar 28, 2025
Update README.md
85fed87
verified
georgebu
commited on
Mar 28, 2025
Update README.md
bd81d9a
verified
georgebu
commited on
Mar 27, 2025
Upload LlamaForSequenceClassification
a9e6c7a
verified
georgebu
commited on
Mar 21, 2025