reiwa7/qwen3-4b-agent-trajectory-lora-lr2e-6-dbv4-alfv5-da14-s300 Text Generation • 4B • Updated about 5 hours ago
RinnRinnmini/qwen3-4b-agent-trajectory-merged-sftdpo_v5 Text Generation • 4B • Updated about 5 hours ago
reiwa7/qwen3-4b-agent-trajectory-lora-lr2e-6-dbv4-alfv5-da14-s360 Text Generation • 4B • Updated about 4 hours ago
reiwa7/qwen3-4b-agent-trajectory-lora-lr2e-6-dbv4-alfv5-da14-s-1 Text Generation • 4B • Updated about 2 hours ago