NobutaMN/qwen25-7b-sft1-mix2-batch2-kai-maxsteps-1_epo1_1.5e-6 Text Generation • 8B • Updated about 16 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011627 Text Generation • 4B • Updated about 16 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011626 Text Generation • 4B • Updated about 16 hours ago
RinnRinnmini/qwen3-4b-agent-trajectory-merged11-sftdpo_v12 Text Generation • 4B • Updated about 16 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011625 Text Generation • 4B • Updated about 16 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011701 Text Generation • 4B • Updated about 15 hours ago
ryo-llm/qwen3-4b-agent-trajectory-lora-202603011702 Text Generation • 4B • Updated about 15 hours ago