ApplePiesFromScratch
/

tiny-agi-propagation-logic

propagation-logic

mechanism-first

abstract-reasoning

derivation-traces

boundary-conditions

Model card Files Files and versions

tiny-agi-propagation-logic / README.md

ApplePiesFromScratch's picture

ApplePiesFromScratch

Update README.md

b26ae5f verified about 2 months ago

|

history blame contribute delete

3.14 kB

	---
	language: en
	license: mit
	tags:
	- propagation-logic
	- mechanism-first
	- abstract-reasoning
	- derivation-traces
	- boundary-conditions
	datasets:
	- ApplePiesFromScratch/dta-benchmark
	metrics:
	- dta
	---

	# MechanismBase — P / G → Q

	A 10M parameter transformer trained on derivation traces, not natural language.

	## What this is

	Standard language models learn statistical patterns over text.
	This model was trained on the procedure P / G → Q — explicit derivation
	traces showing closure analysis, fixed point detection, cycle structure
	identification, and forced boundary condition derivation.

	The claim: given any carrier V and gradient family Γ, the model can derive
	forced boundary conditions — what logic system the carrier implies, what
	fixed points exist, what cycle structure is forced.

	## Theory

	Propagation Logic v13 — SSRN Abstract ID: 6439258 (James Pugmire)

	The single primitive operator: `P / G → Q`

	A loaded pattern P propagates through gradient field G in context C to
	produce updated pattern Q. All of classical logic, fuzzy logic, arithmetic,
	calculus, and grammar fall out of different (V, Γ) choices.

	## Model

	- Architecture: Transformer decoder (custom, mechanism-aligned)
	- Parameters: 10.5M
	- Training tokens: ~1M (derivation traces)
	- Training epochs: 5

	## Benchmark: DTA (Derivation Trace Accuracy)

	The correct benchmark for this model is not BLiMP or MMLU.
	It is DTA — how accurately does the model predict forced boundary conditions
	on novel carriers?

	See: `ApplePiesFromScratch/dta-benchmark`
	\| Model \| DTA-Overall \| DTA-Closure \| DTA-FixedPts \| DTA-Involution \| DTA-Cycle \|
	\|-------\|-------------\|-------------\|--------------\|----------------\|-----------\|
	\| MechanismBase (10M) \| 77.5% \| 80.0% \| 90.0% \| 100.0% \| 40.0% \|
	\| GPT-3.5-turbo (175B)\| 55.0% \| 70.0% \| 10.0% \| 50.0% \| 90.0% \|
	\| GPT-4 (1.8T) \| 87.5% \|100.0% \| 70.0% \| 90.0% \| 90.0% \|
	\| Random baseline \| 25.0% \| 50.0% \| 25.0% \| 50.0% \| 25.0% \|
	\| Engine (oracle) \|100.0% \|100.0% \|100.0% \| 100.0% \|100.0% \|


	## Usage

	```python
	# The model requires the pl/ library and engine.py from the repo
	# Clone: github.com/ApplePiesFromScratch/propagation-logic

	from model import MechanismBase, SmallConfig
	from tokenizers import Tokenizer
	import torch

	config = SmallConfig()
	model = MechanismBase(config)
	# Load weights from Hub (see full usage in repo)

	tokenizer = Tokenizer.from_file("mechanism_tokenizer/tokenizer.json")

	# Give the model a partial derivation trace
	partial = """DOMAIN: color_domain
	CARRIER: ['red', 'green', 'blue']
	GRADIENTS: ['complement', 'id']
	THETA: 1.0
	---
	"""

	ids = torch.tensor(tokenizer.encode(partial).ids).unsqueeze(0)
	output = model.generate(ids, max_new_tokens=200, temperature=0.3)
	print(tokenizer.decode(output[0].tolist()))
	```

	## Training

	```
	python generate_data.py # generates derivation trace corpus
	python tokenizer_train.py # BPE tokenizer on corpus
	python train.py # SmallConfig, ~30 min on RTX 4060 Ti
	```

	## Repository

	GitHub: [ApplePiesFromScratch/propagation-logic](https://github.com/ApplePiesFromScratch/propagation-logic)