Kumeichi/qwen3-4b-agent-trajectory-DataMerge_rev.7_rev.6_lora-SFT-SQL-ALFWorld_rev.0.6 Text Generation • 4B • Updated 1 day ago
hara-CU/Qwen3-4B-LLM2025_DBall_noAgent_AW_345NoEAd_ALF_format1600-r16a32-B16-2ep-5e6 Text Generation • 4B • Updated 1 day ago
melon1891/agentbench-qwen3-4b-2stage-reasoning-20260228 Text Generation • 4B • Updated 1 day ago • 42
RinnRinnmini/qwen3-4b-agent-trajectory-merged11-sftdpo_v10 Text Generation • 4B • Updated 1 day ago • 17