Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DLNorb
/
dpo_base_model_stage2
like
0
Text Generation
PEFT
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qlora
lora
dpo
structured-output
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
dpo_base_model_stage2
Commit History
Upload LoRA adapter (README written by author)
95362f5
verified
DLNorb
commited on
3 days ago
initial commit
e430b46
verified
DLNorb
commited on
3 days ago