Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
albertfares
/
MNLP_M3_dpo_model_69k
like
0
PyTorch
Safetensors
qwen3
fdpo
mnlp
math
code
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
albertfares
commited on
May 30, 2025
Commit
4b9a0c1
·
verified
·
1 Parent(s):
cc1fd2a
Upload fDPO‑trained Qwen3‑0.6B (100k samples) — no local weight load
Browse files
Files changed (1)
hide
show
README.md
+8
-0
README.md
ADDED
Viewed
@@ -0,0 +1,8 @@
1
+
---
2
+
license: apache-2.0
3
+
base_model: Qwen/Qwen3-0.6B-Base
4
+
tags: [fdpo, mnlp, math, code, qwen3]
5
+
---
6
+
# MNLP M3 fDPO model
7
+
8
+
Uploaded 2025-05-30. Full training details in repo history.