Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Pinkstackorg
/
PinkQwen2.5-3B-1M-DPO-preview
like
0
Follow
Pinkstack
1
Text Generation
Transformers
PyTorch
Safetensors
English
qwen2
text-generation-inference
unsloth
trl
dpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
PinkQwen2.5-3B-1M-DPO-preview
Commit History
Adding `safetensors` variant of this model
b1c1660
verified
Pinkstack
commited on
Apr 19, 2025
Trained with Unsloth
7298720
verified
Pinkstack
commited on
Apr 16, 2025
Upload tokenizer
69e5ef1
verified
Pinkstack
commited on
Apr 16, 2025
Upload README.md with huggingface_hub
67bfef0
verified
Pinkstack
commited on
Apr 16, 2025
initial commit
dcbf3ba
verified
Pinkstack
commited on
Apr 16, 2025