Sumiokashi/qwen3-4b-structured-3k-mix-sft_lora-dpo-qwen-cot-merged Text Generation • 4B • Updated 13 days ago • 141