AxionLab-Co
/

MiniAxion1-0.9M

Text Generation

Model card Files Files and versions

AxionLab-official commited on Apr 2

Commit

317cb78

·

verified ·

1 Parent(s): 8bed955

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -107,6 +107,7 @@ MiniAxion1 is not intended to compete with large-scale models. Instead, it is bu
 * Susceptible to incorrect intermediate reasoning steps
 * Limited generalization beyond trained patterns
 * Not suitable for production use in critical systems
 ---
@@ -139,12 +140,14 @@ This model provides early evidence that:
 * Scaling parameters (1M → 10M range)
 * Better coupling between reasoning and answers
 * Task-specific specialization (e.g., math-only variants)
 ---
 ## 🤝 Acknowledgments
 This model was developed as part of ongoing experimentation in nano-scale reasoning systems.
 ---

 * Susceptible to incorrect intermediate reasoning steps
 * Limited generalization beyond trained patterns
 * Not suitable for production use in critical systems
+* Due to 920k parameters, low results on evaluation is expected
 ---
 * Scaling parameters (1M → 10M range)
 * Better coupling between reasoning and answers
 * Task-specific specialization (e.g., math-only variants)
+* distillation knowledge on bigger models
 ---
 ## 🤝 Acknowledgments
 This model was developed as part of ongoing experimentation in nano-scale reasoning systems.
+the main question was: "How low could a model think(or mimic it)?
 ---