Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Klingspor
/
StarPO-4B
like
2
Text Generation
Safetensors
English
qwen3
20-questions
rl
grpo
starpo
multi-turn
information-seeking
reinforcement-learning
credit-assignment
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
StarPO-4B
Commit History
Update README.md
d56f422
verified
Klingspor
commited on
12 days ago
Create README.md
c93dbee
verified
Klingspor
commited on
12 days ago
Upload folder using huggingface_hub
504bf81
verified
Klingspor
commited on
Jan 14
initial commit
ff17c3c
verified
Klingspor
commited on
Jan 14