kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_c Text Generation • 4B • Updated about 16 hours ago
kawashimas/qwen3-4b-agent-trajectory-loraDistillAgain Text Generation • 4B • Updated about 16 hours ago
reiwa7/qwen3-4b-agent-trajectory-lora-lr1-5e-6-alfv5-da14-s270-seed2026-drop0 Text Generation • 4B • Updated about 16 hours ago
hiro0904/Qwen3-4B-Instruct-2507-sft-alfv5-no-cotmarker-lr3em5 Text Generation • 4B • Updated about 16 hours ago
hiro0904/Qwen3-4B-Instruct-2507-sft-alfv5-no-cotmarker-lr5em5 Text Generation • 4B • Updated about 16 hours ago
reiwa7/qwen3-4b-agent-trajectory-lora-lr1-2e-6-alfv5-da14-s270-seed2026-drop0 Text Generation • 4B • Updated about 15 hours ago
NobutaMN/qwen25-7b-sft1-alfworld-v5-maxsteps210_1.5e-6 Text Generation • 8B • Updated about 15 hours ago