rokugatsu's picture
Upload DPO-trained Qwen3-4B-Instruct-2507 model
c87d0f7 verified