Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ryzax
/
1.5B-v70
like
0
Follow
ryzax
8
Text Generation
Transformers
Safetensors
qwen3
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
1.5B-v70
Commit History
Training in progress, step 200
7540e91
verified
Muennighoff
commited on
Sep 12, 2025
Training in progress, step 200
d6a9600
verified
Muennighoff
commited on
Sep 12, 2025
Training in progress, step 100
5fb2ac9
verified
Muennighoff
commited on
Sep 12, 2025
Training in progress, step 100
ec25c73
verified
Muennighoff
commited on
Sep 12, 2025
initial commit
b66a5cb
verified
Muennighoff
commited on
Sep 11, 2025