Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
izzcw
/
dpo_crafting_lora_from_sft
like
0
PEFT
TensorBoard
Safetensors
llama-factory
lora
trl
dpo
Generated from Trainer
License:
llama3
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Use this model
main
dpo_crafting_lora_from_sft
Commit History
End of training
9b56c9a
verified
izzcw
commited on
May 1, 2025
Model save
d5eadb7
verified
izzcw
commited on
May 1, 2025
Training in progress, step 1500
49fc843
verified
izzcw
commited on
May 1, 2025
Training in progress, step 1000
6641905
verified
izzcw
commited on
May 1, 2025
Training in progress, step 500
62d00a9
verified
izzcw
commited on
May 1, 2025
initial commit
6c7ff75
verified
izzcw
commited on
Apr 30, 2025