rokugatsu's picture
Upload DPO-trained Qwen3-4B-Instruct-2507 model
11af1fb verified