Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
satoyutaka
/
LLM_main_LGSFT_150_DPO2
like
0
Text Generation
PEFT
Safetensors
Transformers
dpo
lora
trl
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
LLM_main_LGSFT_150_DPO2
/
README.md
Commit History
Upload 3 files
f417357
verified
satoyutaka
commited on
4 days ago
initial commit
855668b
verified
satoyutaka
commited on
4 days ago