Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Tandogan
/
MNLP_M2_dpo_model
like
0
Safetensors
Tandogan/sft_dataset_final_train
Tandogan/MNLP_M2_dpo_dataset
qwen3
dpo
unsloth
trl
qwen
instruction-tuning
preference-modeling
mnlp
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
e57bc69
MNLP_M2_dpo_model
1.52 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Tandogan
initial commit
e57bc69
verified
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago