FinaPolat
/

llama3_1_8b_thinking_ED

Text Generation

text-generation-inference

Model card Files Files and versions

Uploaded finetuned model

Developed by: FinaPolat
License: apache-2.0
Finetuned from model : FinaPolat/llama3_1_8b_dpo-1k_ED

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 148

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for FinaPolat/llama3_1_8b_thinking_ED

Base model

FinaPolat/llama3_1_8b_sft-1k_ED

Finetuned

FinaPolat/llama3_1_8b_dpo-1k_ED

Finetuned

(1)

this model

Finetunes

1 model