rokugatsu's picture
Upload DPO-trained Qwen3-4B-Instruct-2507 model
d32df1a verified