Integrate LLM guide into README, remove llm folder

- Add LLM Integration section with full architecture docs
- Remove formal verification/Coq references
- Delete llm/ folder (content now in README)

Files changed (2) hide show

README.md +243 -67
llm/core.py +0 -766

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ tags:
 Every logic gate is a threshold neuron: `output = 1 if (Σ wᵢxᵢ + b) ≥ 0 else 0`
 ```
-Tensors:    6,296
-Parameters: 8,267,667
 ```
 ---
@@ -30,7 +30,7 @@ A complete 8-bit processor where every operation—from Boolean logic to arithme
 | Component | Specification |
 |-----------|---------------|
 | Registers | 4 × 8-bit general purpose |
-| Memory | 64KB addressable |
 | ALU | 16 operations (ADD, SUB, AND, OR, XOR, NOT, SHL, SHR, INC, DEC, CMP, NEG, PASS, ZERO, ONES, NOP) |
 | Flags | Zero, Negative, Carry, Overflow |
 | Control | JMP, JZ, JNZ, JC, JNC, JN, JP, JV, JNV, CALL, RET, PUSH, POP |
@@ -85,16 +85,16 @@ The weights in this repository implement a complete 8-bit computer: registers, A
 | Arithmetic | 18 | Half/full adder, 2/4/8-bit ripple carry, comparators |
 | ALU | 3 | 8-bit ALU, control decoder, flag computation |
 | Combinational | 10 | MUX (2:1, 4:1, 8:1), DEMUX, encoders, decoders |
-| Control Flow | 16 | JMP, conditional jumps, CALL, RET, PUSH, POP |
-| Error Detection | 11 | Parity (XOR tree), checksum, CRC, Hamming |
-| Modular | 11 | Divisibility by 2-12 (multi-layer for non-powers-of-2) |
-| Threshold | 13 | k-of-n gates, majority, minority, exactly-k |
-| Pattern | 10 | Popcount, leading/trailing ones, symmetry |
-| Memory | 3 | 16-bit addr decoder, 65536x8 read mux, write cell update (packed) |
 ---
-## Usage
 ```python
 import torch
@@ -113,43 +113,43 @@ for a, b_in in [(0,0), (0,1), (1,0), (1,1)]:
     inp = torch.tensor([a, b_in], dtype=torch.float32)
     out = heaviside(inp @ w + b)
     print(f"AND({a}, {b_in}) = {int(out.item())}")
-```
----
-## State Tensor Layout
-All multi-bit fields are **MSB-first** (index 0 is the most-significant bit).
-```
-[ PC[16] | IR[16] | R0[8] R1[8] R2[8] R3[8] | FLAGS[4] | SP[16] | CTRL[4] | MEM[65536][8] ]
-```
-Flags are ordered as: `Z, N, C, V`.
-Control bits are ordered as: `HALT, MEM_WE, MEM_RE, RESERVED`.
-Total state size: `524376` bits.
----
-## Instruction Encoding (16-bit)
-All instruction bits are **MSB-first**.
-```
-15..12  11..10  9..8  7..0
-opcode  rd      rs    imm8
-```
-Interpretation:
-- **R-type**: `rd = rd op rs` (imm8 ignored).
-- **I-type**: `rd = op rd, imm8` (rs ignored).
-- **Address-extended**: `LOAD`, `STORE`, `JMP`, `JZ`, `CALL` consume the next word as a 16-bit address (big-endian). `imm8` is reserved, and the PC skips 4 bytes when the jump is not taken.
----
-## Verification
 The model includes `iron_eval.py` which exhaustively tests all circuits:
@@ -162,11 +162,11 @@ python iron_eval.py
 | Category | Status | Notes |
 |----------|--------|-------|
-| Boolean gates | Exhaustively tested | Coq proofs available |
-| Arithmetic | Exhaustively tested | Coq proofs available |
-| ALU | Exhaustively tested | Coq proofs available |
-| Control flow | Exhaustively tested | Coq proofs available |
-| Threshold | Exhaustively tested | Coq proofs available |
 | Modular (mod 3,5,6,7,9,10,11,12) | Exhaustively tested | Multi-layer, hand-constructed |
 | Parity | Exhaustively tested | XOR tree, hand-constructed |
 | Modular (mod 2,4,8) | Exhaustively tested | Single-layer, trivial |
@@ -184,15 +184,15 @@ All circuits pass exhaustive testing over their full input domains.
 ```
 {category}.{circuit}[.{layer}][.{component}].{weight|bias}
-Examples:
-  boolean.and.weight
-  boolean.xor.layer1.neuron1.weight
-  arithmetic.ripplecarry8bit.fa7.ha2.sum.layer1.or.weight
-  modular.mod5.layer2.eq3.weight
-  error_detection.paritychecker8bit.stage2.xor1.layer1.nand.bias
-Memory circuits are stored as packed tensors to keep the safetensors header size manageable
-(e.g., `memory.addr_decode.weight`, `memory.read.and.weight`, `memory.write.and_old.weight`).
 ```
 ---
@@ -207,13 +207,185 @@ All weights are integers. All activations are Heaviside step. Designed for:
 ---
 ## Files
 | File | Description |
 |------|-------------|
-| `neural_computer.safetensors` | 6,296 tensors, 8,267,667 parameters |
-| `iron_eval.py` | Comprehensive test suite |
-| `prune_weights.py` | Weight optimization tool |
 ---
@@ -237,7 +409,11 @@ MIT
 ---
-## Links
-- [Coq Proofs](https://github.com/CharlesCNorton/coq-circuits) — Formal verification for core circuits
-- [HuggingFace](https://huggingface.co/phanerozoic) — Other models

 Every logic gate is a threshold neuron: `output = 1 if (Σ wᵢxᵢ + b) ≥ 0 else 0`
 ```
+Tensors:    6,296
+Parameters: 8,267,667
 ```
 ---
 | Component | Specification |
 |-----------|---------------|
 | Registers | 4 × 8-bit general purpose |
+| Memory | 64KB addressable |
 | ALU | 16 operations (ADD, SUB, AND, OR, XOR, NOT, SHL, SHR, INC, DEC, CMP, NEG, PASS, ZERO, ONES, NOP) |
 | Flags | Zero, Negative, Carry, Overflow |
 | Control | JMP, JZ, JNZ, JC, JNC, JN, JP, JV, JNV, CALL, RET, PUSH, POP |
 | Arithmetic | 18 | Half/full adder, 2/4/8-bit ripple carry, comparators |
 | ALU | 3 | 8-bit ALU, control decoder, flag computation |
 | Combinational | 10 | MUX (2:1, 4:1, 8:1), DEMUX, encoders, decoders |
+| Control Flow | 16 | JMP, conditional jumps, CALL, RET, PUSH, POP |
+| Error Detection | 11 | Parity (XOR tree), checksum, CRC, Hamming |
+| Modular | 11 | Divisibility by 2-12 (multi-layer for non-powers-of-2) |
+| Threshold | 13 | k-of-n gates, majority, minority, exactly-k |
+| Pattern | 10 | Popcount, leading/trailing ones, symmetry |
+| Memory | 3 | 16-bit addr decoder, 65536x8 read mux, write cell update (packed) |
 ---
+## Usage
 ```python
 import torch
     inp = torch.tensor([a, b_in], dtype=torch.float32)
     out = heaviside(inp @ w + b)
     print(f"AND({a}, {b_in}) = {int(out.item())}")
+```
+---
+## State Tensor Layout
+All multi-bit fields are **MSB-first** (index 0 is the most-significant bit).
+```
+[ PC[16] | IR[16] | R0[8] R1[8] R2[8] R3[8] | FLAGS[4] | SP[16] | CTRL[4] | MEM[65536][8] ]
+```
+Flags are ordered as: `Z, N, C, V`.
+Control bits are ordered as: `HALT, MEM_WE, MEM_RE, RESERVED`.
+Total state size: `524376` bits.
+---
+## Instruction Encoding (16-bit)
+All instruction bits are **MSB-first**.
+```
+15..12  11..10  9..8  7..0
+opcode  rd      rs    imm8
+```
+Interpretation:
+- **R-type**: `rd = rd op rs` (imm8 ignored).
+- **I-type**: `rd = op rd, imm8` (rs ignored).
+- **Address-extended**: `LOAD`, `STORE`, `JMP`, `JZ`, `CALL` consume the next word as a 16-bit address (big-endian). `imm8` is reserved, and the PC skips 4 bytes when the jump is not taken.
+---
+## Verification
 The model includes `iron_eval.py` which exhaustively tests all circuits:
 | Category | Status | Notes |
 |----------|--------|-------|
+| Boolean gates | Exhaustively tested | All 2^n input combinations |
+| Arithmetic | Exhaustively tested | Full 8-bit range |
+| ALU | Exhaustively tested | All operations, all inputs |
+| Control flow | Exhaustively tested | Branch/jump conditions |
+| Threshold | Exhaustively tested | k-of-n, majority, etc. |
 | Modular (mod 3,5,6,7,9,10,11,12) | Exhaustively tested | Multi-layer, hand-constructed |
 | Parity | Exhaustively tested | XOR tree, hand-constructed |
 | Modular (mod 2,4,8) | Exhaustively tested | Single-layer, trivial |
 ```
 {category}.{circuit}[.{layer}][.{component}].{weight|bias}
+Examples:
+  boolean.and.weight
+  boolean.xor.layer1.neuron1.weight
+  arithmetic.ripplecarry8bit.fa7.ha2.sum.layer1.or.weight
+  modular.mod5.layer2.eq3.weight
+  error_detection.paritychecker8bit.stage2.xor1.layer1.nand.bias
+Memory circuits are stored as packed tensors to keep the safetensors header size manageable
+(e.g., `memory.addr_decode.weight`, `memory.read.and.weight`, `memory.write.and_old.weight`).
 ```
 ---
 ---
+## LLM Integration
+The threshold circuits can be embedded into transformer MLP layers to give LLMs exact arithmetic capability.
+### Core Thesis
+Standard LLMs fail at arithmetic because they're interpolators—they approximate functions over training distributions rather than compute exact results. A 360M parameter model trained on internet text has seen "127 + 128 = 255" zero or few times, so it guesses based on pattern matching.
+We solve this by embedding **frozen, proven-correct arithmetic circuits** directly into the transformer's MLP layers. The circuits use threshold logic (weighted sums + step activation), which is structurally compatible with neural network layers. We train only the **interface layers** that learn to:
+1. Extract operands from token embeddings
+2. Route computation through the circuits
+3. Inject results back into the residual stream
+The model learns **call dispatch**, not arithmetic. The arithmetic is already solved.
+### Architecture
+Standard MLP block with parallel circuit path:
+```
+x ──┬── MLP path ────────────────┬── + ── output
+    │                            │
+    └── BitExtractor ── Circuit ─┴── BitInjector
+                          │
+                       Router (learned weighting)
+```
+Augmented MLP forward pass:
+```python
+def forward(x):  # x: [batch, seq, d_model]
+    # Original MLP path (unchanged)
+    mlp_out = self.down_proj(silu(self.gate_proj(x)) * self.up_proj(x))
+    # Circuit path (new)
+    a_bits, b_bits = self.bit_extractor(x)       # [batch, seq, 8] each
+    result_bits, carry = self.circuits.add_8bit(a_bits, b_bits)
+    flags = self.compute_flags(result_bits, carry)
+    circuit_delta = self.bit_injector(result_bits, flags)
+    # Routing
+    route_weights = self.router(x)  # [batch, seq, 2] softmax
+    # Combine
+    return mlp_out + route_weights[..., 1:2] * circuit_delta
+```
+### Threshold Logic Fundamentals
+A threshold gate computes:
+```
+output = 1  if  (Σ wᵢxᵢ + b) ≥ 0
+         0  otherwise
+```
+Example gates:
+```
+AND: w=[1,1], b=-2
+  AND(0,0) = H(-2) = 0
+  AND(1,1) = H(0)  = 1
+OR: w=[1,1], b=-1
+  OR(0,1) = H(0) = 1
+  OR(1,1) = H(1) = 1
+XOR: requires 2 layers (not linearly separable)
+  Layer 1: OR + NAND
+  Layer 2: AND
+```
+Full adder = 2 half-adders + carry OR, ~4 threshold layers.
+8-bit ripple carry = 8 chained full adders, ~32 threshold layers.
+### Interface Layers (Trainable)
+**BitExtractor** — Maps embedding → two 8-bit operands:
+```python
+class BitExtractor(nn.Module):
+    def __init__(self, d_model):
+        self.proj = nn.Linear(d_model, 16)
+    def forward(self, x):
+        logits = self.proj(x)
+        bits = heaviside(logits)  # STE for training
+        return bits[..., :8], bits[..., 8:]
+```
+**BitInjector** — Maps result bits → embedding delta:
+```python
+class BitInjector(nn.Module):
+    def __init__(self, d_model):
+        self.proj = nn.Linear(16, d_model)
+        self.scale = nn.Parameter(torch.tensor(0.1))
+    def forward(self, result_bits, flags):
+        combined = torch.cat([result_bits, flags], dim=-1)
+        return self.proj(combined) * self.scale
+```
+**Router** — Decides when to use circuits:
+```python
+class Router(nn.Module):
+    def __init__(self, d_model):
+        self.net = nn.Sequential(
+            nn.Linear(d_model, 64), nn.ReLU(),
+            nn.Linear(64, 2), nn.Softmax(dim=-1)
+        )
+```
+### Trainable Parameters
+For SmolLM2-360M (d_model=960), augmenting 11 layers:
+| Component | Params/Layer |
+|-----------|-------------|
+| BitExtractor | 15,376 |
+| BitInjector | 16,321 |
+| Router | 61,698 |
+| OpSelector | ~31,000 |
+| **Total** | ~124,395 |
+**11 layers × 124,395 = ~1.37M trainable parameters** (0.38% of model)
+### Gradient Flow
+Heaviside has zero gradient almost everywhere. We use **Straight-Through Estimator (STE)**:
+```python
+class HeavisideSTE(torch.autograd.Function):
+    @staticmethod
+    def forward(ctx, x):
+        return (x >= 0).float()
+    @staticmethod
+    def backward(ctx, grad_output):
+        return grad_output  # pass through unchanged
+```
+### Training Strategy
+1. **Data**: Generate 8-bit arithmetic problems exhaustively (256×256 = 65,536 unique)
+2. **Loss**: Cross-entropy on answer tokens only (prompt masked with -100)
+3. **Optimizer**: AdamW on interface params only, lr=1e-4
+4. **Curriculum**: Single-digit → two-digit → full 8-bit → adversarial (127+128, 255+1)
+### Inference
+At inference, Heaviside is true step function—no approximation. If BitExtractor correctly extracts operands, the circuit **will** output the correct result. Circuit computation adds ~5-10% latency overhead.
+### Target Performance
+| Model | Baseline | Target |
+|-------|----------|--------|
+| SmolLM2-360M | ~5-10% | >95% |
+The interface generalizes to **all** 65,536 8-bit additions once trained—no memorization, the circuits compute.
+### Extension Roadmap
+- **Additional operations**: Subtraction, multiplication, division, bitwise ops, comparisons
+- **Multi-operand**: "15 + 27 + 33 =" via accumulator pattern
+- **Larger widths**: 16-bit, 32-bit via chained circuits
+- **Symbolic integration**: Natural language problems → extract operands → compute → generate answer
+---
 ## Files
 | File | Description |
 |------|-------------|
+| `neural_computer.safetensors` | 6,296 tensors, 8,267,667 parameters |
+| `cpu/core.py` | CPU state, reference cycle, threshold runtime |
+| `eval/iron_eval.py` | Comprehensive test suite |
 ---
 ---
+## References
+1. McCulloch & Pitts (1943). "A Logical Calculus of Ideas Immanent in Nervous Activity"
+2. Muroga (1971). "Threshold Logic and Its Applications"
+3. Siegelmann & Sontag (1995). "On the Computational Power of Neural Nets"
+4. Bengio et al. (2013). "Estimating or Propagating Gradients Through Stochastic Neurons"
+5. Ma et al. (2024). "The Era of 1-bit LLMs" (BitNet b1.58)
+6. HuggingFace (2024). "SmolLM2: Small Language Models"

llm/core.py DELETED Viewed

@@ -1,766 +0,0 @@
-"""
-Circuit-Augmented LLM: Embedding Threshold Logic Circuits into Transformers
-============================================================================
-Embeds frozen, proven-correct arithmetic circuits into transformer MLP layers.
-The model learns call dispatch (when to use circuits), not arithmetic.
-ARCHITECTURE
-------------
-Standard LLM MLPs are augmented with a parallel circuit path:
-    x ──┬── MLP path ────────────────┬── + ── output
-        │                            │
-        └── BitExtractor ── Circuit ─┴── BitInjector
-                              │
-                           Router (learned weighting)
-THRESHOLD LOGIC
----------------
-Each gate: output = 1 if (Σ wᵢxᵢ + b) ≥ 0 else 0
-Examples:
-    AND: w=[1,1], b=-2  → fires only when both inputs are 1
-    OR:  w=[1,1], b=-1  → fires when either input is 1
-    XOR: 2-layer network (not linearly separable)
-Full adder = 2 half-adders + carry OR, ~4 threshold layers.
-8-bit ripple carry = 8 chained full adders, ~32 threshold layers.
-TRAINING
---------
-Only interface layers train (~1.37M params):
-    - BitExtractor: embedding → operand bits
-    - BitInjector: result bits → embedding delta
-    - Router: when to use circuits vs MLP
-Circuits are frozen (proven correct via 6,590 exhaustive tests).
-Uses Straight-Through Estimator for Heaviside gradient flow.
-TARGET: SmolLM2-360M
-    - 960 hidden dim, 32 layers, 361M params
-    - Augment middle third (layers 10-20)
-    - Baseline arithmetic: ~5-10%
-    - Target: >95% (circuit-accurate)
-USAGE
------
-    # Augment model
-    model = augment_smollm2_with_circuits(model, "neural_computer.safetensors")
-    # Train interface
-    model = train_interface(model, tokenizer, n_epochs=3)
-    # Evaluate
-    results = evaluate_arithmetic(model, tokenizer, n_problems=100)
-REFERENCES
-----------
-1. McCulloch & Pitts (1943). Logical Calculus of Ideas in Nervous Activity
-2. Muroga (1971). Threshold Logic and Its Applications
-3. Bengio et al. (2013). Estimating Gradients Through Stochastic Neurons (STE)
-4. Ma et al. (2024). The Era of 1-bit LLMs (BitNet)
-"""
-from __future__ import annotations
-import argparse
-import warnings
-from typing import Dict, List, Optional, Tuple
-import torch
-import torch.nn as nn
-import torch.nn.functional as F
-from safetensors.torch import load_file
-from torch.utils.data import DataLoader, Dataset
-from tqdm import tqdm
-from transformers import AutoModelForCausalLM, AutoTokenizer
-warnings.filterwarnings("ignore")
-class HeavisideSTE(torch.autograd.Function):
-    """Heaviside step function with straight-through estimator for backprop."""
-    @staticmethod
-    def forward(ctx, x):
-        return (x >= 0).float()
-    @staticmethod
-    def backward(ctx, grad_output):
-        return grad_output
-def heaviside(x: torch.Tensor) -> torch.Tensor:
-    """Heaviside step: 1 if x >= 0, else 0. Uses STE for training."""
-    return HeavisideSTE.apply(x)
-class CircuitExecutor(nn.Module):
-    """
-    Executes threshold logic circuits from safetensors.
-    All circuit weights are frozen.
-    """
-    def __init__(self, circuit_path: str, device: str = "cpu"):
-        super().__init__()
-        self.device = device
-        raw_circuits = load_file(circuit_path)
-        self.circuits = {}
-        for k, v in raw_circuits.items():
-            safe_name = k.replace(".", "__")
-            self.register_buffer(safe_name, v.float().to(device))
-            self.circuits[k] = safe_name
-    def _get(self, name: str) -> torch.Tensor:
-        return getattr(self, self.circuits[name])
-    def eval_and(self, a: torch.Tensor, b: torch.Tensor) -> torch.Tensor:
-        inp = torch.stack([a, b], dim=-1)
-        w = self._get("boolean.and.weight")
-        bias = self._get("boolean.and.bias")
-        return heaviside(inp @ w + bias)
-    def eval_or(self, a: torch.Tensor, b: torch.Tensor) -> torch.Tensor:
-        inp = torch.stack([a, b], dim=-1)
-        w = self._get("boolean.or.weight")
-        bias = self._get("boolean.or.bias")
-        return heaviside(inp @ w + bias)
-    def eval_xor(self, a: torch.Tensor, b: torch.Tensor) -> torch.Tensor:
-        inp = torch.stack([a, b], dim=-1)
-        w1_n1 = self._get("boolean.xor.layer1.neuron1.weight")
-        b1_n1 = self._get("boolean.xor.layer1.neuron1.bias")
-        w1_n2 = self._get("boolean.xor.layer1.neuron2.weight")
-        b1_n2 = self._get("boolean.xor.layer1.neuron2.bias")
-        h1 = heaviside(inp @ w1_n1 + b1_n1)
-        h2 = heaviside(inp @ w1_n2 + b1_n2)
-        hidden = torch.stack([h1, h2], dim=-1)
-        w2 = self._get("boolean.xor.layer2.weight")
-        b2 = self._get("boolean.xor.layer2.bias")
-        return heaviside(hidden @ w2 + b2)
-    def eval_full_adder(
-        self, a: torch.Tensor, b: torch.Tensor, cin: torch.Tensor, prefix: str
-    ) -> Tuple[torch.Tensor, torch.Tensor]:
-        inp_ab = torch.stack([a, b], dim=-1)
-        w1_or = self._get(f"{prefix}.ha1.sum.layer1.or.weight")
-        b1_or = self._get(f"{prefix}.ha1.sum.layer1.or.bias")
-        w1_nand = self._get(f"{prefix}.ha1.sum.layer1.nand.weight")
-        b1_nand = self._get(f"{prefix}.ha1.sum.layer1.nand.bias")
-        w2 = self._get(f"{prefix}.ha1.sum.layer2.weight")
-        b2 = self._get(f"{prefix}.ha1.sum.layer2.bias")
-        h_or = heaviside(inp_ab @ w1_or + b1_or)
-        h_nand = heaviside(inp_ab @ w1_nand + b1_nand)
-        hidden = torch.stack([h_or, h_nand], dim=-1)
-        ha1_sum = heaviside(hidden @ w2 + b2)
-        w_c1 = self._get(f"{prefix}.ha1.carry.weight")
-        b_c1 = self._get(f"{prefix}.ha1.carry.bias")
-        ha1_carry = heaviside(inp_ab @ w_c1 + b_c1)
-        inp_ha2 = torch.stack([ha1_sum, cin], dim=-1)
-        w1_or = self._get(f"{prefix}.ha2.sum.layer1.or.weight")
-        b1_or = self._get(f"{prefix}.ha2.sum.layer1.or.bias")
-        w1_nand = self._get(f"{prefix}.ha2.sum.layer1.nand.weight")
-        b1_nand = self._get(f"{prefix}.ha2.sum.layer1.nand.bias")
-        w2 = self._get(f"{prefix}.ha2.sum.layer2.weight")
-        b2 = self._get(f"{prefix}.ha2.sum.layer2.bias")
-        h_or = heaviside(inp_ha2 @ w1_or + b1_or)
-        h_nand = heaviside(inp_ha2 @ w1_nand + b1_nand)
-        hidden = torch.stack([h_or, h_nand], dim=-1)
-        ha2_sum = heaviside(hidden @ w2 + b2)
-        w_c2 = self._get(f"{prefix}.ha2.carry.weight")
-        b_c2 = self._get(f"{prefix}.ha2.carry.bias")
-        ha2_carry = heaviside(inp_ha2 @ w_c2 + b_c2)
-        inp_cout = torch.stack([ha1_carry, ha2_carry], dim=-1)
-        w_or = self._get(f"{prefix}.carry_or.weight")
-        b_or = self._get(f"{prefix}.carry_or.bias")
-        cout = heaviside(inp_cout @ w_or + b_or)
-        return ha2_sum, cout
-    def add_8bit(
-        self, a_bits: torch.Tensor, b_bits: torch.Tensor
-    ) -> Tuple[torch.Tensor, torch.Tensor]:
-        """
-        8-bit ripple carry addition.
-        a_bits, b_bits: [..., 8] tensors (LSB first)
-        Returns: (result_bits [..., 8], carry_out [...])
-        """
-        batch_shape = a_bits.shape[:-1]
-        carry = torch.zeros(batch_shape, device=a_bits.device)
-        result_bits = []
-        for i in range(8):
-            a_i = a_bits[..., i]
-            b_i = b_bits[..., i]
-            sum_bit, carry = self.eval_full_adder(
-                a_i, b_i, carry, f"arithmetic.ripplecarry8bit.fa{i}"
-            )
-            result_bits.append(sum_bit)
-        return torch.stack(result_bits, dim=-1), carry
-    def greater_than_8bit(
-        self, a_bits: torch.Tensor, b_bits: torch.Tensor
-    ) -> torch.Tensor:
-        diff = a_bits - b_bits
-        w = self._get("arithmetic.greaterthan8bit.comparator")
-        score = (diff * w).sum(dim=-1)
-        return (score > 0).float()
-    def less_than_8bit(
-        self, a_bits: torch.Tensor, b_bits: torch.Tensor
-    ) -> torch.Tensor:
-        diff = b_bits - a_bits
-        w = self._get("arithmetic.lessthan8bit.comparator")
-        score = (diff * w).sum(dim=-1)
-        return (score > 0).float()
-    def equal_8bit(self, a_bits: torch.Tensor, b_bits: torch.Tensor) -> torch.Tensor:
-        gt = self.greater_than_8bit(a_bits, b_bits)
-        lt = self.less_than_8bit(a_bits, b_bits)
-        return (1 - gt) * (1 - lt)
-class BitExtractor(nn.Module):
-    """Maps embedding -> two 8-bit operands."""
-    def __init__(self, d_model: int):
-        super().__init__()
-        self.d_model = d_model
-        self.proj = nn.Linear(d_model, 16)
-        self.temperature = nn.Parameter(torch.tensor(1.0))
-    def forward(self, x: torch.Tensor) -> Tuple[torch.Tensor, torch.Tensor]:
-        logits = self.proj(x)
-        bits = heaviside(logits)
-        a_bits = bits[..., :8]
-        b_bits = bits[..., 8:]
-        return a_bits, b_bits
-class BitInjector(nn.Module):
-    """Maps result bits -> embedding delta."""
-    def __init__(self, d_model: int):
-        super().__init__()
-        self.d_model = d_model
-        self.proj = nn.Linear(16, d_model)
-        self.scale = nn.Parameter(torch.tensor(0.1))
-    def forward(self, result_bits: torch.Tensor, flags: torch.Tensor) -> torch.Tensor:
-        combined = torch.cat([result_bits, flags], dim=-1)
-        return self.proj(combined) * self.scale
-class CircuitAugmentedMLP(nn.Module):
-    """
-    MLP block augmented with frozen threshold circuits.
-    Original MLP runs in parallel with circuit path; router decides weighting.
-    """
-    def __init__(
-        self,
-        d_model: int,
-        intermediate_size: int,
-        circuit_path: str,
-        device: str = "cpu",
-    ):
-        super().__init__()
-        self.d_model = d_model
-        self.gate_proj = nn.Linear(d_model, intermediate_size, bias=False)
-        self.up_proj = nn.Linear(d_model, intermediate_size, bias=False)
-        self.down_proj = nn.Linear(intermediate_size, d_model, bias=False)
-        self.act_fn = nn.SiLU()
-        self.circuits = CircuitExecutor(circuit_path, device)
-        self.bit_extractor = BitExtractor(d_model)
-        self.bit_injector = BitInjector(d_model)
-        self.router = nn.Sequential(
-            nn.Linear(d_model, 64),
-            nn.ReLU(),
-            nn.Linear(64, 2),
-            nn.Softmax(dim=-1),
-        )
-        self.op_selector = nn.Sequential(
-            nn.Linear(d_model, 32),
-            nn.ReLU(),
-            nn.Linear(32, 4),
-            nn.Softmax(dim=-1),
-        )
-    def _compute_flags(
-        self, result_bits: torch.Tensor, carry: torch.Tensor
-    ) -> torch.Tensor:
-        batch_shape = result_bits.shape[:-1]
-        zero = (result_bits.sum(dim=-1) == 0).float()
-        negative = result_bits[..., 7]
-        carry_flag = carry
-        flags = torch.zeros(*batch_shape, 8, device=result_bits.device)
-        flags[..., 0] = zero
-        flags[..., 1] = negative
-        flags[..., 2] = carry_flag
-        return flags
-    def _circuit_forward(self, x: torch.Tensor) -> torch.Tensor:
-        a_bits, b_bits = self.bit_extractor(x)
-        add_result, add_carry = self.circuits.add_8bit(a_bits, b_bits)
-        add_flags = self._compute_flags(add_result, add_carry)
-        circuit_delta = self.bit_injector(add_result, add_flags)
-        return circuit_delta
-    def forward(self, x: torch.Tensor) -> torch.Tensor:
-        mlp_out = self.down_proj(self.act_fn(self.gate_proj(x)) * self.up_proj(x))
-        circuit_out = self._circuit_forward(x)
-        route_weights = self.router(x)
-        circuit_weight = route_weights[..., 1:2]
-        output = mlp_out + circuit_weight * circuit_out
-        return output
-def augment_smollm2_with_circuits(
-    model: AutoModelForCausalLM,
-    circuit_path: str,
-    layer_indices: list = None,
-    device: str = "cpu",
-) -> AutoModelForCausalLM:
-    """
-    Insert circuit blocks into SmolLM2's MLP layers.
-    Args:
-        model: Pretrained SmolLM2
-        circuit_path: Path to neural_computer.safetensors
-        layer_indices: Which layers to augment (default: middle third)
-        device: Device for circuit tensors
-    Returns:
-        Model with circuit-augmented MLPs
-    """
-    config = model.config
-    num_layers = config.num_hidden_layers
-    if layer_indices is None:
-        start = num_layers // 3
-        end = 2 * num_layers // 3
-        layer_indices = list(range(start, end))
-    print(f"Augmenting layers {layer_indices} with threshold circuits...")
-    for idx in layer_indices:
-        layer = model.model.layers[idx]
-        old_mlp = layer.mlp
-        new_mlp = CircuitAugmentedMLP(
-            d_model=config.hidden_size,
-            intermediate_size=config.intermediate_size,
-            circuit_path=circuit_path,
-            device=device,
-        )
-        new_mlp.gate_proj.weight.data = old_mlp.gate_proj.weight.data.clone()
-        new_mlp.up_proj.weight.data = old_mlp.up_proj.weight.data.clone()
-        new_mlp.down_proj.weight.data = old_mlp.down_proj.weight.data.clone()
-        layer.mlp = new_mlp
-    for name, param in model.named_parameters():
-        if "circuits" in name:
-            param.requires_grad = False
-    print("Done. Circuit weights frozen, interfaces trainable.")
-    return model
-def generate_arithmetic_batch(
-    batch_size: int, max_val: int = 255
-) -> Tuple[list, list]:
-    """Generate batch of arithmetic problems and solutions."""
-    prompts = []
-    targets = []
-    for _ in range(batch_size):
-        a = torch.randint(0, max_val + 1, (1,)).item()
-        b = torch.randint(0, max_val + 1, (1,)).item()
-        result = (a + b) % 256
-        prompts.append(f"{a} + {b} =")
-        targets.append(f" {result}")
-    return prompts, targets
-def evaluate_arithmetic(
-    model: AutoModelForCausalLM,
-    tokenizer: AutoTokenizer,
-    n_problems: int = 100,
-    device: str = "cpu",
-) -> dict:
-    """Evaluate model on random arithmetic problems."""
-    correct = 0
-    total = 0
-    errors = []
-    model.eval()
-    for _ in range(n_problems):
-        a = torch.randint(0, 256, (1,)).item()
-        b = torch.randint(0, 256, (1,)).item()
-        expected = (a + b) % 256
-        prompt = f"{a} + {b} ="
-        inputs = tokenizer(prompt, return_tensors="pt").to(device)
-        with torch.no_grad():
-            outputs = model.generate(
-                **inputs,
-                max_new_tokens=10,
-                do_sample=False,
-                pad_token_id=tokenizer.eos_token_id,
-            )
-        response = tokenizer.decode(outputs[0], skip_special_tokens=True)
-        try:
-            answer_part = response.split("=")[-1].strip()
-            predicted = int("".join(c for c in answer_part.split()[0] if c.isdigit()))
-            if predicted == expected:
-                correct += 1
-            else:
-                errors.append((a, b, expected, predicted))
-        except:
-            errors.append((a, b, expected, "parse_error"))
-        total += 1
-    return {
-        "accuracy": correct / total,
-        "correct": correct,
-        "total": total,
-        "errors": errors[:10],
-    }
-class ArithmeticDataset(Dataset):
-    """Dataset of 8-bit addition problems."""
-    def __init__(self, tokenizer, n_samples: int = 10000, max_val: int = 255):
-        self.tokenizer = tokenizer
-        self.n_samples = n_samples
-        self.max_val = max_val
-        self.examples = []
-        for _ in range(n_samples):
-            a = torch.randint(0, max_val + 1, (1,)).item()
-            b = torch.randint(0, max_val + 1, (1,)).item()
-            result = (a + b) % 256
-            prompt = f"{a} + {b} ="
-            target = f" {result}"
-            self.examples.append((prompt, target, a, b, result))
-    def __len__(self):
-        return len(self.examples)
-    def __getitem__(self, idx):
-        prompt, target, a, b, result = self.examples[idx]
-        prompt_ids = self.tokenizer.encode(prompt, add_special_tokens=False)
-        target_ids = self.tokenizer.encode(target, add_special_tokens=False)
-        input_ids = prompt_ids + target_ids
-        labels = [-100] * len(prompt_ids) + target_ids
-        return {
-            "input_ids": torch.tensor(input_ids),
-            "labels": torch.tensor(labels),
-            "a": a,
-            "b": b,
-            "result": result,
-        }
-def collate_fn(batch):
-    """Collate with padding."""
-    max_len = max(len(item["input_ids"]) for item in batch)
-    input_ids = []
-    labels = []
-    attention_mask = []
-    for item in batch:
-        pad_len = max_len - len(item["input_ids"])
-        input_ids.append(
-            torch.cat([item["input_ids"], torch.zeros(pad_len, dtype=torch.long)])
-        )
-        labels.append(
-            torch.cat(
-                [item["labels"], torch.full((pad_len,), -100, dtype=torch.long)]
-            )
-        )
-        attention_mask.append(
-            torch.cat([torch.ones(len(item["input_ids"])), torch.zeros(pad_len)])
-        )
-    return {
-        "input_ids": torch.stack(input_ids),
-        "labels": torch.stack(labels),
-        "attention_mask": torch.stack(attention_mask),
-    }
-def train_interface(
-    model: AutoModelForCausalLM,
-    tokenizer: AutoTokenizer,
-    n_epochs: int = 3,
-    batch_size: int = 16,
-    lr: float = 1e-4,
-    n_train_samples: int = 10000,
-    device: str = "cpu",
-    eval_every: int = 500,
-):
-    """
-    Train the circuit interface layers.
-    Only trains:
-        - bit_extractor (embedding -> bits)
-        - bit_injector (bits -> embedding)
-        - router (circuit vs MLP weighting)
-        - op_selector (which operation)
-    """
-    print("\n" + "=" * 70)
-    print(" TRAINING CIRCUIT INTERFACE")
-    print("=" * 70)
-    interface_params = []
-    frozen_count = 0
-    trainable_count = 0
-    for name, param in model.named_parameters():
-        if any(
-            x in name for x in ["bit_extractor", "bit_injector", "router", "op_selector"]
-        ):
-            param.requires_grad = True
-            interface_params.append(param)
-            trainable_count += param.numel()
-        else:
-            param.requires_grad = False
-            frozen_count += param.numel()
-    print(f"\n  Frozen parameters: {frozen_count:,}")
-    print(f"  Trainable parameters: {trainable_count:,}")
-    print(f"  Training {len(interface_params)} parameter groups")
-    print(f"\n  Creating dataset ({n_train_samples} examples)...")
-    dataset = ArithmeticDataset(tokenizer, n_samples=n_train_samples)
-    dataloader = DataLoader(
-        dataset, batch_size=batch_size, shuffle=True, collate_fn=collate_fn
-    )
-    optimizer = torch.optim.AdamW(interface_params, lr=lr)
-    model.to(device)
-    model.train()
-    global_step = 0
-    total_loss = 0
-    for epoch in range(n_epochs):
-        print(f"\n  Epoch {epoch + 1}/{n_epochs}")
-        print("  " + "-" * 60)
-        epoch_loss = 0
-        epoch_steps = 0
-        pbar = tqdm(dataloader, desc="  Training", leave=False)
-        for batch in pbar:
-            input_ids = batch["input_ids"].to(device)
-            labels = batch["labels"].to(device)
-            attention_mask = batch["attention_mask"].to(device)
-            outputs = model(
-                input_ids=input_ids, attention_mask=attention_mask, labels=labels
-            )
-            loss = outputs.loss
-            optimizer.zero_grad()
-            loss.backward()
-            optimizer.step()
-            epoch_loss += loss.item()
-            epoch_steps += 1
-            global_step += 1
-            total_loss += loss.item()
-            pbar.set_postfix({"loss": f"{loss.item():.4f}"})
-            if global_step % eval_every == 0:
-                model.eval()
-                eval_results = evaluate_arithmetic(
-                    model, tokenizer, n_problems=50, device=device
-                )
-                print(
-                    f"\n    Step {global_step}: Loss={total_loss/eval_every:.4f}, "
-                    f"Accuracy={eval_results['accuracy']*100:.1f}%"
-                )
-                total_loss = 0
-                model.train()
-        avg_loss = epoch_loss / epoch_steps
-        print(f"\n  Epoch {epoch + 1} complete. Avg loss: {avg_loss:.4f}")
-        model.eval()
-        eval_results = evaluate_arithmetic(
-            model, tokenizer, n_problems=100, device=device
-        )
-        print(
-            f"  Evaluation: {eval_results['accuracy']*100:.1f}% "
-            f"({eval_results['correct']}/{eval_results['total']})"
-        )
-        if eval_results["errors"]:
-            print("  Sample errors:")
-            for a, b, exp, got in eval_results["errors"][:3]:
-                print(f"    {a} + {b} = {exp}, model said {got}")
-        model.train()
-    print("\n" + "=" * 70)
-    print(" TRAINING COMPLETE")
-    print("=" * 70)
-    return model
-if __name__ == "__main__":
-    parser = argparse.ArgumentParser(description="Circuit-Augmented LLM")
-    parser.add_argument(
-        "--circuit-path",
-        type=str,
-        default="./neural_computer.safetensors",
-        help="Path to circuit weights",
-    )
-    parser.add_argument("--device", type=str, default="cpu", help="Device")
-    parser.add_argument("--epochs", type=int, default=3, help="Number of epochs")
-    parser.add_argument("--batch-size", type=int, default=8, help="Batch size")
-    parser.add_argument("--lr", type=float, default=1e-4, help="Learning rate")
-    parser.add_argument(
-        "--n-samples", type=int, default=5000, help="Training samples"
-    )
-    parser.add_argument(
-        "--eval-only", action="store_true", help="Only evaluate baseline"
-    )
-    args = parser.parse_args()
-    print("=" * 70)
-    print(" CIRCUIT-AUGMENTED LLM")
-    print("=" * 70)
-    print("\n[1] Loading SmolLM2-360M...")
-    model_id = "HuggingFaceTB/SmolLM2-360M"
-    tokenizer = AutoTokenizer.from_pretrained(model_id)
-    tokenizer.pad_token = tokenizer.eos_token
-    model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float32)
-    print(f"    Parameters: {sum(p.numel() for p in model.parameters()):,}")
-    print("\n[2] Baseline arithmetic evaluation...")
-    baseline = evaluate_arithmetic(model, tokenizer, n_problems=50, device=args.device)
-    print(
-        f"    Accuracy: {baseline['accuracy']*100:.1f}% "
-        f"({baseline['correct']}/{baseline['total']})"
-    )
-    if baseline["errors"]:
-        print("    Sample errors:")
-        for a, b, exp, got in baseline["errors"][:5]:
-            print(f"      {a} + {b} = {exp}, model said {got}")
-    if args.eval_only:
-        print("\nDone (eval only mode).")
-        exit(0)
-    print(f"\n[3] Augmenting with threshold circuits...")
-    print(f"    Circuit path: {args.circuit_path}")
-    model = augment_smollm2_with_circuits(model, args.circuit_path, device=args.device)
-    new_params = sum(p.numel() for p in model.parameters())
-    trainable = sum(p.numel() for p in model.parameters() if p.requires_grad)
-    print(f"    Total parameters: {new_params:,}")
-    print(f"    Trainable parameters: {trainable:,}")
-    print("\n[4] Testing circuit execution...")
-    circuit_exec = CircuitExecutor(args.circuit_path, args.device)
-    test_cases = [(127, 128), (255, 1), (0, 0), (100, 55)]
-    for a, b in test_cases:
-        a_bits = torch.tensor([(a >> i) & 1 for i in range(8)], dtype=torch.float32)
-        b_bits = torch.tensor([(b >> i) & 1 for i in range(8)], dtype=torch.float32)
-        result_bits, carry = circuit_exec.add_8bit(
-            a_bits.unsqueeze(0), b_bits.unsqueeze(0)
-        )
-        result = sum(int(result_bits[0, i].item()) * (2**i) for i in range(8))
-        expected = (a + b) % 256
-        status = "OK" if result == expected else "FAIL"
-        print(f"    {a} + {b} = {result} (expected {expected}) [{status}]")
-    print("\n[5] Training interface layers...")
-    model = train_interface(
-        model,
-        tokenizer,
-        n_epochs=args.epochs,
-        batch_size=args.batch_size,
-        lr=args.lr,
-        n_train_samples=args.n_samples,
-        device=args.device,
-    )
-    print("\n[6] Final evaluation...")
-    final = evaluate_arithmetic(model, tokenizer, n_problems=100, device=args.device)
-    print(f"    Final accuracy: {final['accuracy']*100:.1f}%")
-    print(
-        f"    Improvement: {baseline['accuracy']*100:.1f}% -> {final['accuracy']*100:.1f}%"
-    )
-    save_path = "./circuit_augmented_smollm2.pt"
-    print(f"\n[7] Saving to {save_path}...")
-    torch.save(
-        {
-            "model_state_dict": model.state_dict(),
-            "baseline_accuracy": baseline["accuracy"],
-            "final_accuracy": final["accuracy"],
-        },
-        save_path,
-    )
-    print("\nDone!")