rokugatsu's picture
Upload DPO-trained Qwen3-4B-Instruct-2507 model
853bc77 verified