Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Amshaker
/
Qwen-RL
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
891cf8b
Qwen-RL
181 GB
Ctrl+K
Ctrl+K
1 contributor
History:
12 commits
Amshaker
Upload folder using huggingface_hub
891cf8b
verified
2 months ago
SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B
Upload SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B/latest_checkpointed_iteration.txt with huggingface_hub
2 months ago
qwen3-1.7b-sft-lora
Upload folder using huggingface_hub
2 months ago
.gitattributes
Safe
2.77 kB
Upload folder using huggingface_hub
2 months ago