Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Amshaker
/

Qwen-RL

Model card Files Files and versions

181 GB

Ctrl+K

Ctrl+K

1 contributor

History: 12 commits

Amshaker's picture

Upload folder using huggingface_hub

891cf8b verified 2 months ago

SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B
Upload SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B/latest_checkpointed_iteration.txt with huggingface_hub 2 months ago
qwen3-1.7b-sft-lora
Upload folder using huggingface_hub 2 months ago
.gitattributes

2.77 kB
Upload folder using huggingface_hub 2 months ago