eelixir
/

llama3-8b-thinking-v2

chain-of-thought

Model card Files Files and versions

eelixir commited on Apr 13

Commit

a5de9ed

·

verified ·

1 Parent(s): 7e7aa6a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ This model is a specialized LoRA fine-tune of `Meta-Llama-3-8B-Instruct` designe
 ## 🧠 Model Details
 * **Base Model:** Meta Llama 3 8B Instruct
 * **Fine-Tuning Method:** LoRA (Low-Rank Adaptation) via Unsloth
-* **Dataset:** 476 hand-curated logic, math, and reasoning puzzles.
 * **Epochs:** 3
 * **Primary Goal:** To force "System 2" thinking, reducing hallucinations and impulsive errors on complex prompts.

 ## 🧠 Model Details
 * **Base Model:** Meta Llama 3 8B Instruct
 * **Fine-Tuning Method:** LoRA (Low-Rank Adaptation) via Unsloth
+* **Dataset:** 475 hand-curated logic, math, and reasoning puzzles.
 * **Epochs:** 3
 * **Primary Goal:** To force "System 2" thinking, reducing hallucinations and impulsive errors on complex prompts.