Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ryzax
/
1.5B-v110
like
0
Follow
ryzax
8
Text Generation
Transformers
Safetensors
qwen3
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
1.5B-v110
Commit History
Training in progress, step 250
3aca492
verified
Muennighoff
commited on
Nov 25, 2025
Training in progress, step 200
931ffe0
verified
Muennighoff
commited on
Nov 24, 2025
Training in progress, step 150
065a0d8
verified
Muennighoff
commited on
Nov 23, 2025
Training in progress, step 100
727860b
verified
Muennighoff
commited on
Nov 23, 2025
Training in progress, step 50
4e9665a
verified
Muennighoff
commited on
Nov 22, 2025
initial commit
e788977
verified
Muennighoff
commited on
Nov 21, 2025