Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mingye94
/
rm_llama3_8B_helpsteer2
like
0
Safetensors
llama
trl
reward-trainer
Generated from Trainer
License:
llama3
Model card
Files
Files and versions
xet
Community
main
rm_llama3_8B_helpsteer2
/
tokenizer.json
Commit History
End of training
2dbb74b
verified
mingye94
commited on
Nov 4, 2024
End of training
f7888a9
verified
mingye94
commited on
Nov 4, 2024
End of training
8e1465d
verified
mingye94
commited on
Nov 3, 2024
End of training
114b36d
verified
mingye94
commited on
Nov 3, 2024
End of training
07617d8
verified
mingye94
commited on
Oct 30, 2024