ryo-llm/qwen3-4b-agent-trajectory-lora-202603012323 Text Generation • 4B • Updated about 10 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603012322 Text Generation • 4B • Updated about 10 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603012324 Text Generation • 4B • Updated about 10 hours ago
yuyuchily/agentbench-qwen25-7b-hybrid-alf-react-max2 Text Generation • 8B • Updated about 9 hours ago
hara-CU/Qwen3-4B-DBbase_AW_345NoEAd_ALFformat_QH5L4R5_1392-r16a32-B16-3ep-5e6 Text Generation • 4B • Updated about 9 hours ago
mssfj/Qwen2.5-7B-Instruct_alfworld_dbbench_grpo_merge-2 Text Generation • 8B • Updated about 9 hours ago
NobutaMN/qwen25-7b-sft1-mix2-batch3-one-maxsteps200_epo1_1.0e-6 Text Generation • 8B • Updated about 9 hours ago
reiwa7/qwen3-4b-agent-lora-lr1e-6_dv4_av5_db19_r128_al256_s530_seed42_r0 Text Generation • 4B • Updated about 9 hours ago
reiwa7/qwen3-4b-agent-lora-lr1e-6_dv4_av5_db19_r128_al256_s530_seed777_r0 Text Generation • 4B • Updated about 9 hours ago
RinnRinnmini/qwen3-4b-agent-trajectory-merged11-sftdpo_v14 Text Generation • 4B • Updated about 9 hours ago
hara-CU/Qwen3-4B-LLM2025_DB_all_NoError_plus_AWv345NoEAd_ALF_format1600-r16a32-B16-3ep-5e6 Text Generation • 4B • Updated about 9 hours ago