Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Klingspor
/
StarPO-4B
like
2
Text Generation
Safetensors
English
qwen3
20-questions
rl
grpo
starpo
multi-turn
information-seeking
reinforcement-learning
credit-assignment
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
StarPO-4B
/
tokenizer.json
Commit History
Upload folder using huggingface_hub
504bf81
verified
Klingspor
commited on
Jan 14