ryo-llm/qwen3-4b-agent-trajectory-lora-202603011510 Text Generation • 4B • Updated about 10 hours ago
morizon/qwen2.5-7b-agent-trajectory-lora_0301_run_13 Text Generation • 8B • Updated about 10 hours ago
kuririrn/qwen3-4b-agent-trajectory-lora-sft_multi_dpo Text Generation • 4B • Updated about 10 hours ago
hara-CU/Qwen3-4B-DBbase_AW_345NoEAd_ALFformat_mincut_1609-r16a32-B16-2ep-5e6 Text Generation • 4B • Updated about 9 hours ago
kuririrn/qwen3-4b-agent-trajectory-lora-sft_multi_dpo_merged Text Generation • 4B • Updated about 9 hours ago
hiro0904/agentbench2026-Qwen3-4B-Instruct-2507-sft-alfv5-no-cotmarker-lr1p5em5 Text Generation • 4B • Updated about 3 hours ago
NobutaMN/qwen25-7b-sft1-mix2-batch2-kai-maxsteps-1_epo1_1.5e-6 Text Generation • 8B • Updated about 9 hours ago