Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Pinkstackorg
/
PinkQwen2.5-3B-1M-DPO-preview
like
0
Follow
Pinkstack
1
Text Generation
Transformers
PyTorch
Safetensors
English
qwen2
text-generation-inference
unsloth
trl
dpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
PinkQwen2.5-3B-1M-DPO-preview
Commit History
Update README.md
ec99b1a
verified
Pinkstack
commited on
Apr 23
Adding `safetensors` variant of this model (
#1
)
8081def
verified
Pinkstack
commited on
Apr 23
Update README.md
8d19c5a
verified
Pinkstack
commited on
Apr 23
Trained with Unsloth
7298720
verified
Pinkstack
commited on
Apr 16
Upload tokenizer
69e5ef1
verified
Pinkstack
commited on
Apr 16
Upload README.md with huggingface_hub
67bfef0
verified
Pinkstack
commited on
Apr 16
initial commit
dcbf3ba
verified
Pinkstack
commited on
Apr 16