Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,17 @@ language:
|
|
| 11 |
- en
|
| 12 |
---
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
# Uploaded model
|
| 15 |
|
| 16 |
- **Developed by:** EpistemeAI
|
|
|
|
| 11 |
- en
|
| 12 |
---
|
| 13 |
|
| 14 |
+
This fine tune model is inspired by Nathan Lambert's talk "Traits of Next Generation Reasoning Models".
|
| 15 |
+
It introduces a structured multi-phase reasoning cycle for large language models (LLMs).
|
| 16 |
+
|
| 17 |
+
The fine tune model extends beyond simple question-answer pairs by adding explicit reasoning phases:
|
| 18 |
+
|
| 19 |
+
Planning – The model outlines a step-by-step plan before attempting a solution.
|
| 20 |
+
Answering – The model provides its initial solution.
|
| 21 |
+
Double-Checking – The model revisits its answer, verifying correctness and coherence.
|
| 22 |
+
Confidence – The model assigns a confidence score or justification for its final response.
|
| 23 |
+
This structure encourages models to reason more transparently, self-correct, and calibrate their confidence.
|
| 24 |
+
|
| 25 |
# Uploaded model
|
| 26 |
|
| 27 |
- **Developed by:** EpistemeAI
|