mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5 Text Generation • 8B • Updated about 7 hours ago
kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign Text Generation • 4B • Updated about 7 hours ago
hiro0904/Qwen3-4B-Instruct-2507-sft-mix2_db360_alf2500_split Text Generation • 4B • Updated about 6 hours ago
kuririrn/qwen3-4b-agent-trajectory_alf_admPlusExtra-lora-constraint_gen-dist_allign Text Generation • 4B • Updated about 6 hours ago
hiro0904/Qwen3-4B-Instruct-2507-sft-mix2_db360_alf500_split Text Generation • 4B • Updated about 6 hours ago
OgawaHiroyuki/qwen3-4b-instruct-lora-sft-advanced-comp-v19 Text Generation • 4B • Updated about 6 hours ago
mssfj/Qwen2.5-7B-Instruct_dbbench_sft_dataset_react_v4 Text Generation • 8B • Updated about 6 hours ago