Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
selili688
/
tiny-chatbot-model-dpo
like
0
Text Generation
PEFT
Safetensors
Transformers
dpo
lora
trl
conversational
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Use this model
main
tiny-chatbot-model-dpo
/
README.md
Commit History
DPO adapter
6028948
verified
selili688
commited on
Aug 11, 2025
End of training
e0b927d
verified
selili688
commited on
Aug 11, 2025
Training in progress, step 500
5c9b24c
verified
selili688
commited on
Aug 10, 2025
Training in progress, epoch 0
c292f0a
verified
selili688
commited on
Aug 10, 2025