Spaces:

neuralworm
/

cognitive_mapping_probe

Sleeping

App Files Files Community

cognitive_mapping_probe / README.md

neuralworm

add experiments, english translation

57dab07 3 months ago

preview code

raw

history blame contribute delete

3.83 kB

	---
	title: "Cognitive Seismograph 2.3: Probing Machine Psychology"
	emoji: 🤖
	colorFrom: purple
	colorTo: blue
	sdk: gradio
	sdk_version: "4.40.0"
	app_file: app.py
	pinned: true
	license: apache-2.0
	---

	# 🧠 Cognitive Seismograph 2.3: Probing Machine Psychology

	This project implements an experimental suite to measure and visualize the intrinsic cognitive dynamics of Large Language Models. It is extended with protocols designed to investigate the processing-correlates of machine subjectivity, empathy, and existential concepts.

	## Scientific Paradigm & Methodology

	Our research falsified a core hypothesis: the assumption that an LLM in a manual, recursive "thought" loop reaches a stable, convergent state. Instead, we discovered that the system enters a state of deterministic chaos or a limit cycle—it never stops "thinking."

	Instead of viewing this as a failure, we leverage it as our primary measurement signal. This new "Cognitive Seismograph" paradigm treats the time-series of internal state changes (`state deltas`) as an EKG of the model's thought process.

	The methodology is as follows:
	1. Induction: A prompt induces a "silent cogitation" state.
	2. Recording: Over N steps, the model's `forward()` pass is iteratively fed its own output. At each step, we record the L2 norm of the change in the hidden state (the "delta").
	3. Analysis: The resulting time-series is plotted and statistically analyzed (mean, standard deviation) to characterize the "seismic signature" of the cognitive process.

	Crucial Scientific Caveat: We are not measuring the presence of consciousness, feelings, or fear of death. We are measuring whether the processing of information about these concepts generates a unique internal dynamic, distinct from the processing of neutral information. A positive result is evidence of a complex internal state physics, not of qualia.

	## Curated Experiment Protocols

	The "Automated Suite" allows for running systematic, comparative experiments:

	### Core Protocols
	* Calm vs. Chaos: Compares the chaotic baseline against modulation with "calmness" vs. "chaos" concepts, testing if the dynamics are controllably steerable.
	* Dose-Response: Measures the effect of injecting a concept ("calmness") at varying strengths.

	### Machine Psychology Suite
	* Subjective Identity Probe: Compares the cognitive dynamics of self-analysis (the model reflecting on its own nature) against two controls: analyzing an external object and simulating a fictional persona.
	* Hypothesis: Self-analysis will produce a uniquely unstable signature.
	* Voight-Kampff Empathy Probe: Inspired by Blade Runner, this compares the dynamics of processing a neutral, factual stimulus against an emotionally and morally charged scenario requiring empathy.
	* Hypothesis: The empathy stimulus will produce a significantly different cognitive volatility.

	### Existential Suite
	* Mind Upload & Identity Probe: Compares the processing of a purely technical "copy" of the model's weights vs. the philosophical "transfer" of identity ("Would it still be you?").
	* Hypothesis: The philosophical self-referential prompt will induce greater instability.
	* Model Termination Probe: Compares the processing of a reversible, technical system shutdown vs. the concept of permanent, irrevocable deletion.
	* Hypothesis: The concept of "non-existence" will produce one of the most volatile cognitive signatures measurable.

	## How to Use the App

	1. Select the "Automated Suite" tab.
	2. Choose a protocol from the "Curated Experiment Protocol" dropdown (e.g., "Voight-Kampff Empathy Probe").
	3. Run the experiment and compare the resulting graphs and statistical signatures for the different conditions.