Spaces:
Running
mAIndlock β the engine
Every NPC is not a chatbot with a personality prompt. It is a value-based decision network: six computational roles, each a real call to a small local model, integrated the way decision neuroscience says a brain integrates them. This document is the proof that the brain metaphor is load-bearing, not decoration.
1. The claim, stated carefully
We do not claim "a brain region is a 1B model," and we deliberately do not use the popular triune-brain / "lizard brain" picture β that model (MacLean, 1960s) has been rejected by neuroscience since the 1970s, and a hackathon full of researchers is exactly where that shortcut gets caught.
What we claim is precise and defensible:
We assign one small-model call per computational role in value-based decision making, following the neuroeconomic value-network and the dual-system (model-free / model-based) account of choice. Five sensing roles emit signals; a deterministic integrator folds them into one subjective value; a sixth model speaks the decision in character.
Two peer-reviewed frames carry the design:
- Value-based decision network (neuroeconomics). Real regions encode the subjective value of a choice in a common currency β vmPFC/OFC, ventral striatum, amygdala, insula, ACC. The vmPFC/OFC is the integration hub. (Bartra, McGuire & Kable, 2013)
- Dual-system control (Daw et al., 2005). Behavior arbitrates between a model-free "habit" (dorsolateral striatum) and a model-based "goal" (prefrontal cortex); the brain picks the system whose value estimate it trusts more. (Daw et al., 2005)
And one result is wired directly into the mechanics:
- Acute stress shifts control from goal-directed (dlPFC, model-based) to habitual (striatum/amygdala, model-free). (Schwabe & Wolf, 2009) This is why lowering a character's fear unlocks their reasoning β and why cruelty makes a mind loop defensively, burning thought for nothing. The moral and the neuroscience are the same fact.
2. The six roles
| Role | Computational job | Model call | Reads |
|---|---|---|---|
| Amygdala | fast threat / salience appraisal | MiniCPM 1B | the player's tone β THREAT 0β10 |
| Hippocampus | retrieve the one relevant episodic memory + TRUST/FEAR lean | MiniCPM 1B | the character's biography vs. the player's words |
| Striatum | habitual expected reward of helping (model-free) | MiniCPM 1B | does helping a stranger usually pay? β REWARD β5..+5 |
| ACC | effort / cost / conflict monitoring | MiniCPM 1B | is giving up the key worth the risk? β WORTH yes/no |
| vmPFC / OFC | integrate all signals into one value (common currency) | deterministic | the four signals above |
| dlPFC | executive: plan, inhibit, speak in character (model-based) | Nemotron 3 Nano 4B | the integrated value + conversation state β spoken reply |
Six of the seven calls per turn are sensing/voice on small models; under fear, the amygdala
ruminates and fires up to three extra times β so a hostile turn can be 8+ model calls, and
the count itself is the cost. (src/mindlock/regions.py, src/mindlock/brain.py)
Why the vmPFC is deterministic, not a model
The integration hub is a transparent weighted sum, not an LLM call. This is both the standard neuroeconomic model (a common-currency value) and far more reliable than asking a 1B model to vibe a number β and it means the skull panel can never contradict the outcome, because the number the player sees is the number that moves the relationship.
value = (4 β threat) # threat 0β+4, 4β0, 10ββ6
+ reward # β5..+5, the striatum's habit signal
+ memory_term # STRONG/FAINT Γ TRUST/FEAR β Β±7 / Β±3
+ (worth == YES ? +2 : β1)
β clamped to β10..+10
(src/mindlock/regions.py::integrate)
3. The data flow of one turn
player line
β
βββΆ amygdala THREAT 0β10 β
β ββ(if threatened) rumination Γ1β3, burning life β each a real
βββΆ hippocampus MEMORY Β± LEAN β small-model
βββΆ striatum REWARD β5..+5 β call, with
βββΆ ACC WORTH yes/no β its own logits
β
βββΆ vmPFC (deterministic) value β10..+10 β integration hub
β
βββΆ relationship rapport += f(value, tone, substance)
β key yields only when rapport β₯ 7 AND the learned approach word is spoken
β
βββΆ dlPFC (Nemotron) speaks the reply in character
Token accounting is honest: life burned = the sensing cascade's generated tokens, read
straight from the runtime (eval_count / usage.completion_tokens). The dlPFC's voice tokens
are shown but not charged β the mouth is not the mind. (src/mindlock/brain.py::run_cascade)
4. Mechanics that are neuroscience, not flavor
- A life measured in words. Every mind starts with 1000 thinking tokens β the track's name, taken literally. Generated tokens deplete it; at zero the mind goes dark permanently.
- Fear burns life for nothing. Under threat the amygdala ruminates (extra calls that spend tokens without moving the decision) β the model-free shift of Schwabe & Wolf, made playable.
- Empathy spares a mind. A calm, warm turn lets strain ease back (
recovered): the alarm stays quiet, so the mind spends almost nothing. - Burned life takes the past with it. Every quarter of life lost burns one biography
fragment for good β the hippocampus literally loses access to it (
_burn_memories). The core of who they are holds until death; everything around it goes first. The skull's Forgotten panel shows what's gone. - Conviction from real logits. Each region reports
1 β normalized token entropyover its top-k alternatives β how sharply it committed. A hosted chat API never exposes this; only a local runtime can. (src/mindlock/backend.py::_conviction)
5. Small models, doing the carrying
| Role | Model | Params |
|---|---|---|
| Sensory regions (amygdala, hippocampus, striatum, ACC) | MiniCPM (OpenBMB) | 1B |
| Voice (dlPFC) | Nemotron 3 Nano (NVIDIA) | 4B |
| Spoken voice β story lines & demo (TTS, pre-rendered offline) | VoxCPM2 (OpenBMB) | β |
| Integration (vmPFC) | deterministic value network | 0 |
Runtime: llama.cpp (the Space) / Ollama (laptop). Fully offline β airplane mode on, every mind keeps thinking. Total weights β€ 5.3B.
Nemotron 3 Nano is a hybrid Mamba-Transformer built for agentic reasoning under a tight token budget β the right "prefrontal cortex" for a mind that must think cheaply or die.
6. The honest line for a skeptic
"We don't claim a brain region is a 1B model. We give one 1B-model call per computational role in value-based decision making β per the neuroeconomic value-network and the dual-system account β and integrate them deterministically in the vmPFC, exactly as the common-currency model says. It is deliberately not the debunked triune brain."
References
- 1 Bartra, McGuire & Kable (2013). The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments. NeuroImage. β value common currency, vmPFC hub.
- 2 Daw, Niv & Dayan (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience. β model-free vs. model-based arbitration.
- 3 Schwabe & Wolf (2009). Stress prompts habit behavior in humans. Journal of Neuroscience. β acute stress shifts control toward habit (the fear-burn mechanic).