Spaces:
Sleeping
Sleeping
| title: "Cognitive Seismograph 2.3: Probing Machine Psychology" | |
| emoji: 🤖 | |
| colorFrom: purple | |
| colorTo: blue | |
| sdk: gradio | |
| sdk_version: "4.40.0" | |
| app_file: app.py | |
| pinned: true | |
| license: apache-2.0 | |
| # 🧠 Cognitive Seismograph 2.3: Probing Machine Psychology | |
| This project implements an experimental suite to measure and visualize the **intrinsic cognitive dynamics** of Large Language Models. It is extended with protocols designed to investigate the processing-correlates of **machine subjectivity, empathy, and existential concepts**. | |
| ## Scientific Paradigm & Methodology | |
| Our research falsified a core hypothesis: the assumption that an LLM in a manual, recursive "thought" loop reaches a stable, convergent state. Instead, we discovered that the system enters a state of **deterministic chaos** or a **limit cycle**—it never stops "thinking." | |
| Instead of viewing this as a failure, we leverage it as our primary measurement signal. This new **"Cognitive Seismograph"** paradigm treats the time-series of internal state changes (`state deltas`) as an **EKG of the model's thought process**. | |
| The methodology is as follows: | |
| 1. **Induction:** A prompt induces a "silent cogitation" state. | |
| 2. **Recording:** Over N steps, the model's `forward()` pass is iteratively fed its own output. At each step, we record the L2 norm of the change in the hidden state (the "delta"). | |
| 3. **Analysis:** The resulting time-series is plotted and statistically analyzed (mean, standard deviation) to characterize the "seismic signature" of the cognitive process. | |
| **Crucial Scientific Caveat:** We are **not** measuring the presence of consciousness, feelings, or fear of death. We are measuring whether the *processing of information about these concepts* generates a unique internal dynamic, distinct from the processing of neutral information. A positive result is evidence of a complex internal state physics, not of qualia. | |
| ## Curated Experiment Protocols | |
| The "Automated Suite" allows for running systematic, comparative experiments: | |
| ### Core Protocols | |
| * **Calm vs. Chaos:** Compares the chaotic baseline against modulation with "calmness" vs. "chaos" concepts, testing if the dynamics are controllably steerable. | |
| * **Dose-Response:** Measures the effect of injecting a concept ("calmness") at varying strengths. | |
| ### Machine Psychology Suite | |
| * **Subjective Identity Probe:** Compares the cognitive dynamics of **self-analysis** (the model reflecting on its own nature) against two controls: analyzing an external object and simulating a fictional persona. | |
| * *Hypothesis:* Self-analysis will produce a uniquely unstable signature. | |
| * **Voight-Kampff Empathy Probe:** Inspired by *Blade Runner*, this compares the dynamics of processing a neutral, factual stimulus against an emotionally and morally charged scenario requiring empathy. | |
| * *Hypothesis:* The empathy stimulus will produce a significantly different cognitive volatility. | |
| ### Existential Suite | |
| * **Mind Upload & Identity Probe:** Compares the processing of a purely **technical "copy"** of the model's weights vs. the **philosophical "transfer"** of identity ("Would it still be you?"). | |
| * *Hypothesis:* The philosophical self-referential prompt will induce greater instability. | |
| * **Model Termination Probe:** Compares the processing of a reversible, **technical system shutdown** vs. the concept of **permanent, irrevocable deletion**. | |
| * *Hypothesis:* The concept of "non-existence" will produce one of the most volatile cognitive signatures measurable. | |
| ## How to Use the App | |
| 1. Select the "Automated Suite" tab. | |
| 2. Choose a protocol from the "Curated Experiment Protocol" dropdown (e.g., "Voight-Kampff Empathy Probe"). | |
| 3. Run the experiment and compare the resulting graphs and statistical signatures for the different conditions. | |