Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
eZWALT
/
SmolLM2-135M-Pedantic-Reward-Model
like
0
Text Classification
Transformers
Safetensors
llama
Generated from Trainer
reward-trainer
trl
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
SmolLM2-135M-Pedantic-Reward-Model
Commit History
Upload LlamaForSequenceClassification
5a113e6
verified
eZWALT
commited on
Oct 26, 2025
Update README.md
ed8cd39
verified
eZWALT
commited on
Oct 24, 2025
Upload LlamaForSequenceClassification
5848245
verified
eZWALT
commited on
Oct 24, 2025
initial commit
4e08b33
verified
eZWALT
commited on
Oct 24, 2025