Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xx18
/
DirectRL_Qwen3-4B_baseline2
like
0
Text Generation
Safetensors
POLARIS-Project/Polaris-Dataset-53K
English
qwen3
reasoning
conversational
Model card
Files
Files and versions
xet
Community
main
DirectRL_Qwen3-4B_baseline2
/
tokenizer.json
Commit History
Upload folder using huggingface_hub
c8607f4
verified
xx18
commited on
Nov 7, 2025