Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ricostaedeli
/
Meta-Llama-3.1-8B-Instruct_DPO_1-lora
like
0
Transformers
Safetensors
English
text-generation-inference
unsloth
llama
trl
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Meta-Llama-3.1-8B-Instruct_DPO_1-lora
Commit History
Trained with Unsloth
0493b25
verified
ricostaedeli
commited on
May 29, 2025
Trained with Unsloth
02f531a
verified
ricostaedeli
commited on
May 29, 2025
Trained with Unsloth
1d160bf
verified
ricostaedeli
commited on
May 29, 2025
Trained with Unsloth
cb32641
verified
ricostaedeli
commited on
May 29, 2025
Upload README.md with huggingface_hub
84a5ab1
verified
ricostaedeli
commited on
May 29, 2025
initial commit
33a1554
verified
ricostaedeli
commited on
May 29, 2025