Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Veiterr
/
MNLP_M2_dpo_model_unsloth
like
0
Text Generation
Transformers
Safetensors
qwen3
unsloth
trl
dpo
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MNLP_M2_dpo_model_unsloth
Commit History
Upload model trained with Unsloth
2789e9e
verified
Veiterr
commited on
Jun 5, 2025
Trained with Unsloth
dd29504
verified
Veiterr
commited on
Jun 5, 2025
Upload tokenizer
0e17687
verified
Veiterr
commited on
Jun 5, 2025
Upload model trained with Unsloth
aae5448
verified
Veiterr
commited on
Jun 5, 2025
Trained with Unsloth
0e029aa
verified
Veiterr
commited on
Jun 5, 2025
Upload tokenizer
6e12a5c
verified
Veiterr
commited on
Jun 5, 2025
Upload model trained with Unsloth
7933c15
verified
Veiterr
commited on
Jun 5, 2025
Trained with Unsloth
93305bf
verified
Veiterr
commited on
Jun 5, 2025
Upload tokenizer
ae55032
verified
Veiterr
commited on
Jun 5, 2025
Upload model trained with Unsloth
1dc4609
verified
Veiterr
commited on
Jun 5, 2025
Trained with Unsloth
008274b
verified
Veiterr
commited on
Jun 5, 2025
Upload tokenizer
18e8f06
verified
Veiterr
commited on
Jun 5, 2025
initial commit
94ba163
verified
Veiterr
commited on
Jun 5, 2025