EpistemeAI
/

gpt-oss-deepplan

text-generation-inference

Model card Files Files and versions

legolasyiu commited on Sep 20

Commit

3d5e600

·

verified ·

1 Parent(s): 3016354

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -11,6 +11,17 @@ language:
 - en
 ---
 # Uploaded  model
 - **Developed by:** EpistemeAI

 - en
 ---
+This fine tune model is inspired by Nathan Lambert's talk "Traits of Next Generation Reasoning Models".
+It introduces a structured multi-phase reasoning cycle for large language models (LLMs).
+The fine tune model extends beyond simple question-answer pairs by adding explicit reasoning phases:
+Planning – The model outlines a step-by-step plan before attempting a solution.
+Answering – The model provides its initial solution.
+Double-Checking – The model revisits its answer, verifying correctness and coherence.
+Confidence – The model assigns a confidence score or justification for its final response.
+This structure encourages models to reason more transparently, self-correct, and calibrate their confidence.
 # Uploaded  model
 - **Developed by:** EpistemeAI