File size: 8,022 Bytes
9a41b58
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
# mAIndlock β€” the engine

> Every NPC is not a chatbot with a personality prompt. It is a **value-based decision
> network**: six computational roles, each a real call to a small local model, integrated
> the way decision neuroscience says a brain integrates them. This document is the proof
> that the brain metaphor is load-bearing, not decoration.

---

## 1. The claim, stated carefully

We do **not** claim "a brain region *is* a 1B model," and we deliberately do **not** use the
popular **triune-brain / "lizard brain"** picture β€” that model (MacLean, 1960s) has been
rejected by neuroscience since the 1970s, and a hackathon full of researchers is exactly
where that shortcut gets caught.

What we claim is precise and defensible:

> We assign **one small-model call per computational role** in value-based decision making,
> following the **neuroeconomic value-network** and the **dual-system (model-free / model-based)**
> account of choice. Five sensing roles emit signals; a deterministic integrator folds them into
> one subjective value; a sixth model speaks the decision in character.

Two peer-reviewed frames carry the design:

1. **Value-based decision network (neuroeconomics).** Real regions encode the subjective value
   of a choice in a *common currency* β€” vmPFC/OFC, ventral striatum, amygdala, insula, ACC. The
   **vmPFC/OFC is the integration hub.** ([Bartra, McGuire & Kable, 2013][1])
2. **Dual-system control (Daw et al., 2005).** Behavior arbitrates between a **model-free "habit"**
   (dorsolateral striatum) and a **model-based "goal"** (prefrontal cortex); the brain picks the
   system whose value estimate it trusts more. ([Daw et al., 2005][2])

And one result is wired directly into the mechanics:

3. **Acute stress shifts control from goal-directed (dlPFC, model-based) to habitual
   (striatum/amygdala, model-free).** ([Schwabe & Wolf, 2009][3]) This is *why* lowering a
   character's fear unlocks their reasoning β€” and why cruelty makes a mind loop defensively,
   burning thought for nothing. The moral and the neuroscience are the same fact.

---

## 2. The six roles

| Role | Computational job | Model call | Reads |
|---|---|---|---|
| **Amygdala** | fast threat / salience appraisal | MiniCPM 1B | the player's tone β†’ `THREAT 0–10` |
| **Hippocampus** | retrieve the one relevant episodic memory + TRUST/FEAR lean | MiniCPM 1B | the character's **biography** vs. the player's words |
| **Striatum** | habitual expected reward of helping (model-free) | MiniCPM 1B | does helping a stranger usually pay? β†’ `REWARD βˆ’5..+5` |
| **ACC** | effort / cost / conflict monitoring | MiniCPM 1B | is giving up the key worth the risk? β†’ `WORTH yes/no` |
| **vmPFC / OFC** | **integrate all signals into one value** (common currency) | *deterministic* | the four signals above |
| **dlPFC** | executive: plan, inhibit, speak in character (model-based) | Nemotron 3 Nano 4B | the integrated value + conversation state β†’ spoken reply |

**Six of the seven calls per turn are sensing/voice on small models; under fear, the amygdala
ruminates and fires up to three extra times** β€” so a hostile turn can be **8+ model calls**, and
the *count itself* is the cost. (`src/mindlock/regions.py`, `src/mindlock/brain.py`)

### Why the vmPFC is deterministic, not a model

The integration hub is a **transparent weighted sum**, not an LLM call. This is both the
standard neuroeconomic model (a common-currency value) *and* far more reliable than asking a
1B model to vibe a number β€” and it means the **skull panel can never contradict the outcome**,
because the number the player sees *is* the number that moves the relationship.

```
value =  (4 βˆ’ threat)            # threat 0β†’+4, 4β†’0, 10β†’βˆ’6
       +  reward                 # βˆ’5..+5, the striatum's habit signal
       +  memory_term            # STRONG/FAINT Γ— TRUST/FEAR β†’ Β±7 / Β±3
       +  (worth == YES ? +2 : βˆ’1)
       β†’ clamped to βˆ’10..+10
```

(`src/mindlock/regions.py::integrate`)

---

## 3. The data flow of one turn

```
player line
   β”‚
   β”œβ”€β–Ά amygdala      THREAT 0–10        ┐
   β”‚     └─(if threatened) rumination Γ—1–3, burning life       β”‚ each a real
   β”œβ”€β–Ά hippocampus   MEMORY Β± LEAN      β”‚ small-model
   β”œβ”€β–Ά striatum      REWARD βˆ’5..+5      β”‚ call, with
   β”œβ”€β–Ά ACC           WORTH yes/no       β”˜ its own logits
   β”‚
   β”œβ”€β–Ά vmPFC (deterministic)   value βˆ’10..+10   ← integration hub
   β”‚
   β”œβ”€β–Ά relationship  rapport += f(value, tone, substance)
   β”‚                 key yields only when rapport β‰₯ 7 AND the learned approach word is spoken
   β”‚
   └─▢ dlPFC (Nemotron)  speaks the reply in character
```

Token accounting is honest: **life burned = the sensing cascade's generated tokens**, read
straight from the runtime (`eval_count` / `usage.completion_tokens`). The dlPFC's voice tokens
are *shown but not charged* β€” the mouth is not the mind. (`src/mindlock/brain.py::run_cascade`)

---

## 4. Mechanics that are neuroscience, not flavor

- **A life measured in words.** Every mind starts with **1000 thinking tokens** β€” the track's
  name, taken literally. Generated tokens deplete it; at zero the mind goes dark permanently.
- **Fear burns life for nothing.** Under threat the amygdala ruminates (extra calls that spend
  tokens without moving the decision) β€” the model-free shift of Schwabe & Wolf, made playable.
- **Empathy spares a mind.** A calm, warm turn lets strain ease back (`recovered`): the alarm
  stays quiet, so the mind spends almost nothing.
- **Burned life takes the past with it.** Every quarter of life lost burns one biography
  fragment **for good** β€” the hippocampus literally loses access to it (`_burn_memories`). The
  core of who they are holds until death; everything around it goes first. The skull's
  *Forgotten* panel shows what's gone.
- **Conviction from real logits.** Each region reports `1 βˆ’ normalized token entropy` over its
  top-k alternatives β€” how sharply it committed. **A hosted chat API never exposes this; only a
  local runtime can.** (`src/mindlock/backend.py::_conviction`)

---

## 5. Small models, doing the carrying

| Role | Model | Params |
|---|---|---|
| Sensory regions (amygdala, hippocampus, striatum, ACC) | **MiniCPM (OpenBMB)** | 1B |
| Voice (dlPFC) | **Nemotron 3 Nano (NVIDIA)** | 4B |
| Spoken voice β€” story lines & demo (TTS, pre-rendered offline) | **VoxCPM2 (OpenBMB)** | β€” |
| Integration (vmPFC) | deterministic value network | 0 |

Runtime: **llama.cpp** (the Space) / **Ollama** (laptop). Fully offline β€” airplane mode on,
every mind keeps thinking. Total weights **≀ 5.3B**.

Nemotron 3 Nano is a hybrid **Mamba-Transformer** built for agentic reasoning under a tight
token budget β€” the right "prefrontal cortex" for a mind that must think cheaply or die.

---

## 6. The honest line for a skeptic

> "We don't claim a brain region is a 1B model. We give one 1B-model call per *computational
> role* in value-based decision making β€” per the neuroeconomic value-network and the dual-system
> account β€” and integrate them deterministically in the vmPFC, exactly as the common-currency
> model says. It is deliberately **not** the debunked triune brain."

---

## References

[1]: https://pmc.ncbi.nlm.nih.gov/articles/PMC3721023/
[2]: https://www.nature.com/articles/nn1560
[3]: https://www.jneurosci.org/content/29/22/7191

- **[1]** Bartra, McGuire & Kable (2013). *The valuation system: a coordinate-based
  meta-analysis of BOLD fMRI experiments.* NeuroImage. β€” value common currency, vmPFC hub.
- **[2]** Daw, Niv & Dayan (2005). *Uncertainty-based competition between prefrontal and
  dorsolateral striatal systems for behavioral control.* Nature Neuroscience. β€” model-free vs.
  model-based arbitration.
- **[3]** Schwabe & Wolf (2009). *Stress prompts habit behavior in humans.* Journal of
  Neuroscience. β€” acute stress shifts control toward habit (the fear-burn mechanic).