Kumeichi/qwen3-4b-agent-trajectory-DataMerge_rev.7_rev.6_lora-SFT-SQL-ALFWorld_rev.0.6 Text Generation • 4B • Updated 7 days ago
Kumeichi/qwen3-4b-agent-trajectory-DataMerge_rev.6_rev.6_lora-SFT-SQL-ALFWorld_rev.0.6 Text Generation • 4B • Updated 7 days ago
Kumeichi/qwen3-4b-agent-trajectory-DataMerge_rev.5_rev.6_lora-SFT-SQL-ALFWorld_rev.0.6 Text Generation • 4B • Updated 7 days ago
Kumeichi/qwen3-4b-agent-trajectory-DataMerge_rev.4_rev.6_lora-SFT-SQL-ALFWorld_rev.0.6 Text Generation • 4B • Updated 8 days ago
Kumeichi/qwen3-4b-agent-Kumeichi-Data-lora-SFT-SQL-ALFWorld_rev.0.3 Text Generation • 4B • Updated 8 days ago
Kumeichi/qwen3-4b-agent-Kumeichi-Data-lora-SFT-SQL-ALFWorld_rev.0.2 Text Generation • 4B • Updated 9 days ago
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0.6 Text Generation • 4B • Updated 10 days ago
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0.5 Text Generation • 4B • Updated 10 days ago
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0.4 Text Generation • 4B • Updated 10 days ago
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0.3 Text Generation • 4B • Updated 11 days ago
Kumeichi/qwen3-4b-agent-lora-SFT-SQL-ALFWorld_rev.Kume0.2 Text Generation • 4B • Updated 15 days ago • 74
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0.2 Text Generation • 4B • Updated 19 days ago
Kumeichi/qwen3-4b-agent-trajectory-lora-SFT-SQL-ALFWorld_rev.0 Text Generation • 4B • Updated 19 days ago
Kumeichi/dpo-rev.0.1-qwen-cot-merged-with-qwen3-4b-SFT-rev.0.8 Text Generation • 4B • Updated 28 days ago • 32