MNLP_M3_dpo_model_69k / tokenizer.json

Commit History

Upload fDPO trained Qwen3-0.6B model on MNLP M3 dataset (69k samples)
cc1fd2a
verified

albertfares commited on