reiwa7/qwen3-4b-agent-lora-lr2e-6-d4-a5-d01-s270-seed2026-drop0 Text Generation • 4B • Updated Feb 25
reiwa7/qwen3-4b-agent-lora-lr2e-6-d4-a5-d005-s270-seed2026-drop0 Text Generation • 4B • Updated Feb 25
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_base Text Generation • 8B • Updated Feb 25