Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Jlonge4
/
phi4-mini-judge-r
like
0
Transformers
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
phi4-mini-judge-r
Commit History
End of training
5fff7f8
verified
Jlonge4
commited on
Oct 16, 2025
End of training
8f7565e
verified
Jlonge4
commited on
Oct 3, 2025
initial commit
481b470
verified
Jlonge4
commited on
Oct 3, 2025