kuririrn/qwen3-4b-agent-trajectory-lora-sft_dpo_v3-a Text Generation • 4B • Updated about 21 hours ago
hara-CU/Qwen3-4B-DBbase_AW_345NoEAd_ALFformat_allstep_1800-r16a32-B16-2ep-5e6 Text Generation • 4B • Updated about 21 hours ago