Upload fDPO trained Qwen3-0.6B model on MNLP M3 dataset (69k samples) 4f3038c verified albertfares commited on May 31