albertfares
/

MNLP_M3_dpo_model

Text Generation

Model card Files Files and versions

albertfares commited on Jun 2, 2025

Commit

f85d307

·

verified ·

1 Parent(s): 08b0cb8

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ language:
 pipeline_tag: text-generation
 ---
-# MNLP M3 fDPO Model (69k samples)
 This model is a fine-tuned version of [Qwen/Qwen3-0.6B-Base](https://huggingface.co/Qwen/Qwen3-0.6B-Base) using **filtered Direct Preference Optimization (fDPO)** on the [MNLP M3 DPO dataset](https://huggingface.co/datasets/albertfares/MNLP_M3_dpo_dataset).

 pipeline_tag: text-generation
 ---
+# MNLP M3 fDPO Model (187k samples)
 This model is a fine-tuned version of [Qwen/Qwen3-0.6B-Base](https://huggingface.co/Qwen/Qwen3-0.6B-Base) using **filtered Direct Preference Optimization (fDPO)** on the [MNLP M3 DPO dataset](https://huggingface.co/datasets/albertfares/MNLP_M3_dpo_dataset).