AxionLab-official commited on
Commit
8bed955
ยท
verified ยท
1 Parent(s): f550502

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +159 -3
README.md CHANGED
@@ -1,3 +1,159 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ metrics:
6
+
7
+ pipeline_tag: text-generation
8
+ tags:
9
+ - nrm
10
+ - nano
11
+ - reasoning
12
+ - thinking
13
+ - sub-1m
14
+ - lowparams
15
+ - custom_code
16
+ ---
17
+
18
+ # ๐Ÿง  MiniAxion1-0.9M
19
+
20
+ **MiniAxion1-0.9M** is a Nano Reasoning Model (NRM) with ~920K parameters designed to explore the emergence of structured reasoning in extremely small neural networks.
21
+
22
+ Despite its minimal size, the model demonstrates strong consistency in reasoning format and step-based thinking using explicit `<THINK>` and `<STEP>` tokens.
23
+
24
+ ---
25
+
26
+ ## ๐Ÿš€ Overview
27
+
28
+ * **Model Type:** Nano Reasoning Model (NRM)
29
+ * **Parameters:** ~920,833
30
+ * **Architecture:** Transformer (6 layers: 2 entry + 2 shared + 2 exit)
31
+ * **d_model:** 256
32
+ * **Heads:** 8
33
+ * **FFN size:** 512
34
+ * **LoRA Rank:** 16
35
+ * **Vocabulary Size:** 2048
36
+ * **Training Time:** ~80 minutes (CPU)
37
+
38
+ ---
39
+
40
+ ## ๐Ÿง  Key Capabilities
41
+
42
+ ### โœ… Structured Reasoning
43
+
44
+ The model reliably produces structured reasoning traces:
45
+
46
+ ```
47
+ <THINK>
48
+ <STEP> ...
49
+ <STEP> ...
50
+ </THINK>
51
+ <ANS>...</ANS>
52
+ ```
53
+
54
+ * 100% usage of reasoning tokens
55
+ * Consistent multi-step formatting
56
+ * Stable output structure across tasks
57
+
58
+ ---
59
+
60
+ ### โšก Ultra-Lightweight
61
+
62
+ * Runs efficiently on CPU
63
+ * Designed for experimentation and rapid iteration
64
+ * Suitable for embedded or game-like environments
65
+
66
+ ---
67
+
68
+ ### ๐Ÿงช Research-Oriented Design
69
+
70
+ MiniAxion1 is not intended to compete with large-scale models. Instead, it is built to:
71
+
72
+ * Study reasoning emergence in small models
73
+ * Explore structure vs correctness trade-offs
74
+ * Enable fast iteration cycles for AI research
75
+
76
+ ---
77
+
78
+ ## ๐Ÿ“Š Evaluation Results
79
+
80
+ | Task | Accuracy |
81
+ | ----------------------- | -------- |
82
+ | Arithmetic | 3.3% |
83
+ | Two-Step Arithmetic | 10.0% |
84
+ | Even/Odd | 100.0% |
85
+ | Comparison | 5.0% |
86
+ | Pattern Completion | 0.0% |
87
+ | Word Problems | 0.0% |
88
+ | Sorting | 0.0% |
89
+ | Chain-of-Thought Format | 100.0% |
90
+
91
+ **Average Accuracy:** 16.9%
92
+
93
+ ---
94
+
95
+ ## ๐Ÿ” Observations
96
+
97
+ * The model learns reasoning *structure* before reasoning *correctness*
98
+ * Chain-of-thought formatting is highly reliable
99
+ * Arithmetic and symbolic reasoning remain limited at this scale
100
+ * Evidence of partial decoupling between reasoning steps and final answers
101
+
102
+ ---
103
+
104
+ ## โš ๏ธ Limitations
105
+
106
+ * Weak performance on arithmetic and multi-step reasoning tasks
107
+ * Susceptible to incorrect intermediate reasoning steps
108
+ * Limited generalization beyond trained patterns
109
+ * Not suitable for production use in critical systems
110
+
111
+ ---
112
+
113
+ ## ๐ŸŽฏ Intended Use Cases
114
+
115
+ * ๐Ÿงช AI research and experimentation
116
+ * ๐ŸŽฎ Game AI / NPC reasoning simulation
117
+ * ๐Ÿ“š Educational demonstrations of reasoning structure
118
+ * โš™๏ธ Lightweight reasoning prototypes
119
+
120
+ ---
121
+
122
+ ## ๐Ÿง  Philosophy
123
+
124
+ MiniAxion1 explores a key question:
125
+
126
+ > *Can structured reasoning emerge in extremely small models?*
127
+
128
+ This model provides early evidence that:
129
+
130
+ * Reasoning format can be learned efficiently
131
+ * Structure and correctness are separable capabilities
132
+ * Useful behavior can emerge even at sub-1M scale
133
+
134
+ ---
135
+
136
+ ## ๐Ÿ”ฎ Future Directions
137
+
138
+ * Improved dataset alignment for arithmetic reasoning
139
+ * Scaling parameters (1M โ†’ 10M range)
140
+ * Better coupling between reasoning and answers
141
+ * Task-specific specialization (e.g., math-only variants)
142
+
143
+ ---
144
+
145
+ ## ๐Ÿค Acknowledgments
146
+
147
+ This model was developed as part of ongoing experimentation in nano-scale reasoning systems.
148
+
149
+ ---
150
+
151
+ ## ๐Ÿ“Ž Model
152
+
153
+ ๐Ÿ‘‰ https://huggingface.co/AxionLab-Co/MiniAxion1-0.9M
154
+
155
+ ---
156
+
157
+ ## ๐Ÿงช Disclaimer
158
+
159
+ This is an experimental research model. Outputs may be incorrect even when reasoning appears structured or convincing.