Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Amshaker
/
Qwen-RL
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
Qwen-RL
/
grpo_2048_thinking_step_6000
21.9 GB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Amshaker
Upload folder using huggingface_hub
52ce062
verified
about 2 months ago
actor
Upload folder using huggingface_hub
about 2 months ago
data.pt
7.32 kB
xet
Upload folder using huggingface_hub
about 2 months ago