Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
georgebu
/
reward_model
like
0
Text Classification
Transformers
Safetensors
HumanLLMs/Human-Like-DPO-Dataset
English
llama
trl
reward-trainer
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
reward_model
Commit History
Update README.md
d2c46b2
verified
georgebu
commited on
Mar 28, 2025
Update README.md
3f85e92
verified
georgebu
commited on
Mar 28, 2025
Update README.md
85fed87
verified
georgebu
commited on
Mar 28, 2025
Update README.md
bd81d9a
verified
georgebu
commited on
Mar 27, 2025
Upload LlamaForSequenceClassification
f20b8f5
verified
georgebu
commited on
Mar 21, 2025
Upload LlamaForSequenceClassification
a9e6c7a
verified
georgebu
commited on
Mar 21, 2025
initial commit
9daa9cf
verified
georgebu
commited on
Mar 21, 2025