albertfares
/

MNLP_M3_dpo_model_69k

Model card Files Files and versions

albertfares commited on May 30, 2025

Commit

4b9a0c1

·

verified ·

1 Parent(s): cc1fd2a

Upload fDPO‑trained Qwen3‑0.6B (100k samples) — no local weight load

Files changed (1) hide show

README.md +8 -0

README.md ADDED Viewed

	@@ -0,0 +1,8 @@

+---
+license: apache-2.0
+base_model: Qwen/Qwen3-0.6B-Base
+tags: [fdpo, mnlp, math, code, qwen3]
+---
+# MNLP M3 fDPO model
+Uploaded 2025-05-30. Full training details in repo history.