Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
michlea
/
HidatoQwenModel
like
0
Transformers
Safetensors
Generated from Trainer
unsloth
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
HidatoQwenModel
/
training_args.bin
Commit History
Training in progress, step 150
00ac0b3
verified
michlea
commited on
Oct 8, 2025
Training in progress, step 150
3e9e831
verified
michlea
commited on
Oct 8, 2025
Training in progress, step 500
22d70f1
verified
michlea
commited on
Oct 8, 2025
Training in progress, step 500
32c051a
verified
michlea
commited on
Oct 7, 2025
Training in progress, step 250
3c452ce
verified
michlea
commited on
Oct 7, 2025
Training in progress, step 250
4fb26ab
verified
michlea
commited on
Oct 6, 2025
Training in progress, step 250
b1e60c2
verified
michlea
commited on
Oct 6, 2025
Training in progress, step 50
00a70ca
verified
michlea
commited on
Oct 6, 2025
Training in progress, step 50
f9a2df5
verified
michlea
commited on
Oct 5, 2025
Training in progress, step 50
520f107
verified
michlea
commited on
Sep 16, 2025
Training in progress, step 50
5ea6e60
verified
michlea
commited on
Sep 15, 2025
Training in progress, step 50
b3b320a
verified
michlea
commited on
Sep 14, 2025
Training in progress, step 50
4b8eb9c
verified
michlea
commited on
Sep 13, 2025
Training in progress, step 100
fef8bbb
verified
michlea
commited on
Sep 13, 2025