Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Amshaker
/
Qwen-RL
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
Qwen-RL
340 GB
Ctrl+K
Ctrl+K
1 contributor
History:
19 commits
Amshaker
Upload folder using huggingface_hub
d7dde68
verified
about 1 month ago
GRPO_1024_global_step_2680
Upload folder using huggingface_hub
about 2 months ago
GRPO_1024_global_step_6000
Upload folder using huggingface_hub
about 1 month ago
GRPO_2048_global_step_2800
Upload folder using huggingface_hub
about 2 months ago
GRPO_2048_global_step_700
Upload folder using huggingface_hub
about 2 months ago
Polaris-Reproduce-1.7B-1-node
Upload folder using huggingface_hub
about 1 month ago
SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B
Upload SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B/latest_checkpointed_iteration.txt with huggingface_hub
about 2 months ago
grpo_2048_thinking_step_6000
Upload folder using huggingface_hub
about 1 month ago
qwen3-1.7b-sft-lora
Upload folder using huggingface_hub
about 2 months ago
qwen3-1.7b-sft
Upload folder using huggingface_hub
about 2 months ago
.gitattributes
3.54 kB
Upload folder using huggingface_hub
about 1 month ago