ryo-llm/qwen3-4b-agent-trajectory-lora-202602252246 Text Generation • 4B • Updated about 10 hours ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-11 Text Generation • 8B • Updated about 9 hours ago
reiwa7/qwen3-4b-agent-trajectory-lora-lr1-2e-6-alfv5-da14-s270-seed2026f-drop0 Text Generation • 4B • Updated about 7 hours ago
hiro0904/Qwen3-4B-Instruct-2507-sft-alfv5-no-cotmarker-lr2em5-sft-dbb-rank-agg-lr5em6 Text Generation • 4B • Updated about 9 hours ago
NobutaMN/qwen25-7b-sft1-dbbench-v2-maxsteps-1_1.5e-6 Text Generation • 8B • Updated about 8 hours ago
hiro0904/Qwen3-4B-Instruct-2507-sft-alfv5-no-cotmarker-lr2em5-sft-dbb-rank-agg-lr1em5 Text Generation • 4B • Updated about 8 hours ago