reiwa7/qwen3-4b-agent-lora-lr1e-6_dv4_av5_db19_r128_al256_s525_seed2026_r0 Text Generation • 4B • Updated 1 day ago
melon1891/agentbench-qwen3-4b-2stage-alfw-db-20260301-lr1e6-10ep Text Generation • 4B • Updated 1 day ago • 17
reiwa7/qwen3-4b-agent-lora-lr1e-6_dv4_av5_db19_r128_al256_s535_seed2026_r0 Text Generation • 4B • Updated 1 day ago
melon1891/agentbench-qwen3-4b-2stage-alfw-db-20260301-lr1e6-5ep Text Generation • 4B • Updated 1 day ago • 13
hara-CU/Qwen3-4B-DBbase_AW_345NoEAd_ALFformat_mean1std_1151-r16a32-B16-2ep-5e6 Text Generation • 4B • Updated 1 day ago
RinnRinnmini/qwen3-4b-agent-trajectory-merged11-sftdpo_v11 Text Generation • 4B • Updated 1 day ago • 20