rokugatsu's picture
Upload DPO-trained Qwen3-4B-Instruct-2507 model
782a5c2 verified