Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ tags:
22
 
23
  # Axion1-350K-A250K
24
 
25
- > **DeepSeek-V3 architecture scaled to ~344k total parameters (~160k active/token) — runs entirely on CPU.**
26
 
27
  Built from scratch as a proof-of-concept that the real DeepSeek-V3 architectural innovations
28
  (MLA + DeepSeekMoE + auxiliary-loss-free load balancing) work correctly even at extreme miniaturization.
 
22
 
23
  # Axion1-350K-A250K
24
 
25
+ > **DeepSeek-V3 architecture scaled to \~344k total parameters (\~160k active/token) — runs entirely on CPU.**
26
 
27
  Built from scratch as a proof-of-concept that the real DeepSeek-V3 architectural innovations
28
  (MLA + DeepSeekMoE + auxiliary-loss-free load balancing) work correctly even at extreme miniaturization.