kuririrn/qwen3-4b-agent-trajectory_alfadm_dbweek-lora-constraint_gen-dist_allign_v2 Text Generation • 4B • Updated about 4 hours ago
kuririrn/qwen3-4b-agent-trajectory_alfadm_dbweek-lora-constraint_gen-dist_allign_v3 Text Generation • 4B • Updated about 4 hours ago
NobutaMN/qwen25-7b-sft1-alfworld-v5-maxsteps-1_1.5e-6 Text Generation • 8B • Updated about 4 hours ago
OgawaHiroyuki/qwen3-4b-instruct-lora-sft-advanced-comp-v24 Text Generation • 4B • Updated about 4 hours ago
nak-tak225/qwen3-4b-structured-output-lora-sft-advanced-chappi_v2 Text Generation • 4B • Updated about 1 hour ago
reiwa7/qwen3-4b-agent-trajectory-lora-lr2e-6-alfv5-da14-s270-seed2027-drop0 Text Generation • 4B • Updated about 3 hours ago
OgawaHiroyuki/qwen3-4b-instruct-lora-sft-advanced-comp-v25 Text Generation • 4B • Updated about 3 hours ago
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0.3 Text Generation • 4B • Updated about 3 hours ago
reiwa7/qwen3-4b-agent-trajectory-lora-lr2e-6-alfv5-da14-s270-seed2028-drop0 Text Generation • 4B • Updated about 3 hours ago
kawashimas/qwen3-4b-agent-trajectory-loraDistillChallenge Text Generation • 4B • Updated about 2 hours ago