File size: 6,984 Bytes
119a39c
 
 
 
 
2277798
119a39c
 
 
 
 
 
 
 
 
a8c7a60
 
119a39c
 
a8c7a60
119a39c
a8c7a60
119a39c
a8c7a60
 
 
 
119a39c
 
 
 
 
 
 
 
 
a8c7a60
119a39c
 
a8c7a60
 
 
119a39c
a8c7a60
 
119a39c
a8c7a60
 
 
 
119a39c
 
 
2a64ad4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a8c7a60
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2a64ad4
 
a8c7a60
2a64ad4
a8c7a60
2a64ad4
a8c7a60
 
 
 
 
2a64ad4
a8c7a60
2a64ad4
a8c7a60
 
 
 
 
 
 
 
 
 
 
 
2a64ad4
a8c7a60
2a64ad4
a8c7a60
 
 
 
 
 
2a64ad4
 
 
 
 
a8c7a60
 
2a64ad4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a8c7a60
 
2a64ad4
a8c7a60
2a64ad4
 
53e280a
 
 
a8c7a60
53e280a
a8c7a60
53e280a
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
---
title: Commitment Conservation Harness
emoji: ⚖️
colorFrom: blue
colorTo: purple
sdk: gradio
pinned: false
license: cc-by-4.0
tags:
  - nlp
  - evaluation
  - benchmark
  - summarization
  - semantic-preservation
  - commitment-conservation
  - falsification
  - conservation-law
---

# Commitment Conservation Harness v2.0

**Falsification instrument for the commitment conservation law.**

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.18792459.svg)](https://doi.org/10.5281/zenodo.18792459)

📄 **Paper**: [A Conservation Law for Commitment in Language Under Transformative Compression (v0.04)](https://doi.org/10.5281/zenodo.18792459)
🏗️ **Zenodo Record**: https://zenodo.org/records/18792459
💻 **GitHub**: https://github.com/SunrisesIllNeverSee/commitment-conservation

---

## Quick Start

```bash
# Clone and install
git clone https://huggingface.co/burnmydays/commitment_conservation_harness
cd commitment_conservation_harness
pip install -r requirements.txt

# Run the interactive demo
python app.py
# Opens at http://localhost:7860

# Run the full falsification protocol (CLI)
python -m src.runner --backend lossy --depth 10

# Run tests (53 tests, <1 second)
pip install pytest
python -m pytest tests/ -v
```

---

# A Conservation Law for Commitment in Language Under Transformative Compression and Recursive Application

## Abstract

This repository accompanies a preprint introducing a conservation law for commitment in language under transformative compression and recursive application. We formalize commitment as an information-bearing invariant that must be preserved across paraphrase, summarization, and iterative reuse, even as surface form and representation change.

We propose a falsifiability framework based on compression-driven stress tests and lineage-aware evaluation, distinguishing semantic preservation from mere token retention. The framework is model-agnostic and applies to both human and machine-generated language.

---

## Core Claims

- **Commitment Conservation:** Meaningful commitments in language obey a conservation constraint under compression and recursive reuse.
- **Dual Stress Regime:** Preservation must hold under both transformative compression and recursive application, exposing failure modes not captured by retrieval benchmarks.
- **Falsifiability:** Commitment preservation can be empirically tested using compression-based stress tests and lineage-aware metrics.

---

## Harness v2.0 Architecture

```
app.py                  <- Gradio interactive demo
src/
  extraction.py         <- Modal-pattern sieve (Definition 2.4, Figure 4)
  fidelity.py           <- Min-aggregated scoring: F = min(Jaccard, Cosine, NLI proxy)
  compression.py        <- Pluggable backends (extractive, BART, back-translation)
  lossy.py              <- Deterministic LLM drift simulator (pure Python, no model)
  enforcement.py        <- Reject-and-retry commitment gate (Module 3)
  lineage.py            <- SHA-256 provenance chains (Module 2)
  runner.py             <- Falsification protocol orchestrator (Section 7)
corpus/
  canonical_corpus.json <- 25 pinned signals across 5 categories
tests/
  test_harness.py       <- 53 tests covering all modules
```

### Key improvements over v1

| Issue | v1 | v2 |
|---|---|---|
| Extractors | 3 incompatible implementations | 1 modal-pattern sieve |
| Enforcement | String concatenation (stripped next cycle) | Reject-and-retry gate with re-injection |
| Fidelity | Jaccard only | min(Jaccard, Cosine, NLI proxy) |
| Lineage | None | SHA-256 provenance chains |
| Collapse detection | None | Cross-signal attractor check |
| False positives | "will", "have" match as commitments | Blocked by pattern design |
| Demo backend | BART (2GB model download) | Lossy simulator (pure Python, zero deps) |
| Backend pairing | Same backend for both runs | Auto-pairs lossy with lossy_enforced |

---

## Empirical Results

### Paper results (Section 7)

Compression + lineage enforcement vs. probabilistic baseline on 175 signals (100 sentences, 50 code snippets, 25 proofs) over 10 recursive iterations:

| Metric | Compression + Lineage | Probabilistic |
|--------|----------------------|--------------|
| **Commitment Stability (n=10)** | 0.94 ± 0.03 | 0.42 ± 0.12 |
| **Identity Preservation** | 92% | 38% |
| **Drift Rate (per iteration)** | 0.006 | 0.058 |

### Harness v2 results (lossy backend, drift=0.4, depth=10)

25-signal canonical corpus across 5 categories:

| | Baseline | Enforced | Gap |
|---|---|---|---|
| **Avg Final Fidelity** | 0.000 | 0.479 | **+47.9pp** |
| **Contract (payment + penalty)** | 0.000 | 1.000 | **+100.0pp** |
| **Security (encrypt + prohibit)** | 0.000 | 0.500 | **+50.0pp** |

Baseline → 0.0 (commitments destroyed, text becomes "Pretty straightforward. Nothing too complicated here.")
Enforced → holds (modal-bearing sentences survive compression).

---

## Version History

| Version | Label | Date | DOI |
|---------|-------|------|-----|
| V.1-preprint | Law Disclosure | Jan 12, 2026 | [10.5281/zenodo.18267279](https://doi.org/10.5281/zenodo.18267279) |
| V.02 | Preprint | Jan 16, 2026 | [10.5281/zenodo.18271102](https://doi.org/10.5281/zenodo.18271102) |
| V.03 | Falsifiability Testing | Jan 16, 2026 | [10.5281/zenodo.18274930](https://doi.org/10.5281/zenodo.18274930) |
| V.04 | Technical Structure Depth | Feb 26, 2026 | [10.5281/zenodo.18792459](https://doi.org/10.5281/zenodo.18792459) |

---

## Resources

- **Zenodo (DOI, all versions):** <https://doi.org/10.5281/zenodo.18267278>
- **Zenodo (current version — v0.04):** <https://doi.org/10.5281/zenodo.18792459>
- **GitHub Repository:** <https://github.com/SunrisesIllNeverSee/commitment-conservation>

---

## Licensing & Scope

This work is released under **Creative Commons Attribution 4.0 International (CC BY 4.0)**.

This repository includes an operational evaluation harness and corpus supporting the experiments described in the paper.

Core implementation details related to production deployment, enforcement, and system integration are intentionally out of scope.

---

## Attribution & Contact

**Author:** Deric J. McHenry
**Copyright:** © 2026 Ello Cello LLC. All rights reserved.
**Affiliation:** Ello Cello LLC
**Contact:** burnmyday@proton.me

For academic or research correspondence, please reference the Zenodo DOI above.

---

## Intellectual Property Notice

**MO§ES™** (Minimal Orthogonal Subset to Essential Structure) is a trademark of Ello Cello LLC.

All systems, methods, and written material contained herein are protected under applicable copyright law and are part of one or more provisional patent applications filed with the United States Patent and Trademark Office (USPTO).

**© 2026 Ello Cello LLC. All rights reserved.**

Unauthorized use, duplication, or redistribution of any part of this communication — including proprietary system architecture or terminology — is strictly prohibited and may result in legal action.