ryo-llm/qwen3-4b-agent-trajectory-lora-202603011855 Text Generation • 4B • Updated about 12 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011854 Text Generation • 4B • Updated about 12 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011856 Text Generation • 4B • Updated about 12 hours ago
kawashimas/qwen3-4b-agent-trajectory-loraDistill_3_1 Text Generation • 4B • Updated about 12 hours ago
mssfj/Qwen2.5-7B-Instruct_alfworld_dbbench_grpo_merge Text Generation • 8B • Updated about 11 hours ago
melon1891/agentbench-qwen3-4b-dbalf-20260301-lr1e6-v2 Text Generation • 4B • Updated about 12 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011923 Text Generation • 4B • Updated about 11 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011924 Text Generation • 4B • Updated about 11 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011925 Text Generation • 4B • Updated about 11 hours ago
NobutaMN/qwen25-7b-sft1-mix2-batch3-kai-maxsteps180_epo1_1.0e-6 Text Generation • 8B • Updated about 11 hours ago
Mountaingorillas/Qwen-2.5-7B-Instruct-Agentbench-lora-db-Strict Text Generation • 8B • Updated about 11 hours ago
NobutaMN/qwen25-7b-sft1-mix2-batch3-kai-maxsteps-1_epo1_1.5e-6 Text Generation • 8B • Updated about 11 hours ago