MNLP_M3_dpo_model / pytorch_model.bin

Commit History

Upload fDPO trained Qwen3-0.6B model on MNLP M3 dataset (69k samples)
4f3038c
verified

albertfares commited on