hara-CU/Qwen2.5-7B-Instruct-DB_all_NoError_plus_naturalproofs_Length2.5-8k-SFT-r64a128-B16-1ep-1e4 Text Generation • 8B • Updated about 16 hours ago
hara-CU/Qwen2.5-7B-Instruct-LLM2025_DBall_NoErrorQ45-SFT-r64a128-B16-3ep-1e4 Text Generation • 8B • Updated about 15 hours ago
NobutaMN/qwen25-7b-sft1-alfworld-v5-maxsteps140_1.75e-6 Text Generation • 8B • Updated about 14 hours ago
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0.6 Text Generation • 4B • Updated about 14 hours ago
RinnRinnmini/qwen3-4b-agent-trajectory-merged1-sftdpo_v6 Text Generation • 4B • Updated about 14 hours ago
reiwa7/qwen3-4b-agent-lora-lr2e-6-d4-a5-d01-s270-seed2026-drop0 Text Generation • 4B • Updated about 13 hours ago
reiwa7/qwen3-4b-agent-lora-lr2e-6-d4-a5-d005-s270-seed2026-drop0 Text Generation • 4B • Updated about 13 hours ago