eelixir commited on
Commit
a5de9ed
·
verified ·
1 Parent(s): 7e7aa6a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ This model is a specialized LoRA fine-tune of `Meta-Llama-3-8B-Instruct` designe
18
  ## 🧠 Model Details
19
  * **Base Model:** Meta Llama 3 8B Instruct
20
  * **Fine-Tuning Method:** LoRA (Low-Rank Adaptation) via Unsloth
21
- * **Dataset:** 476 hand-curated logic, math, and reasoning puzzles.
22
  * **Epochs:** 3
23
  * **Primary Goal:** To force "System 2" thinking, reducing hallucinations and impulsive errors on complex prompts.
24
 
 
18
  ## 🧠 Model Details
19
  * **Base Model:** Meta Llama 3 8B Instruct
20
  * **Fine-Tuning Method:** LoRA (Low-Rank Adaptation) via Unsloth
21
+ * **Dataset:** 475 hand-curated logic, math, and reasoning puzzles.
22
  * **Epochs:** 3
23
  * **Primary Goal:** To force "System 2" thinking, reducing hallucinations and impulsive errors on complex prompts.
24