Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
JaishreeramCoder
/
reward_model
like
0
Text Classification
Transformers
Safetensors
gemma2
trl
reward-trainer
4-bit precision
bitsandbytes
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
reward_model
2.36 GB
1 contributor
History:
3 commits
JaishreeramCoder
Upload tokenizer
f4064b0
verified
over 1 year ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
over 1 year ago
README.md
Safe
5.19 kB
Upload Gemma2ForSequenceClassification
over 1 year ago
config.json
Safe
1.66 kB
Upload Gemma2ForSequenceClassification
over 1 year ago
model.safetensors
2.32 GB
xet
Upload Gemma2ForSequenceClassification
over 1 year ago
special_tokens_map.json
Safe
522 Bytes
Upload tokenizer
over 1 year ago
tokenizer.json
Safe
34.4 MB
xet
Upload tokenizer
over 1 year ago
tokenizer.model
Safe
4.24 MB
xet
Upload tokenizer
over 1 year ago
tokenizer_config.json
Safe
46.4 kB
Upload tokenizer
over 1 year ago