MNLP_M3_dpo_model / tokenizer.json

Commit History

Upload fDPO trained Qwen3-0.6B model on MNLP M3 dataset (69k samples)
4f3038c
verified

albertfares commited on