celiumsAI commited on
Commit
6e0b7c1
·
verified ·
1 Parent(s): 3d57ce4

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -84,3 +84,18 @@ paper for the full method and the honest negatives.
84
 
85
  Code: **GPL-3.0**. Paper & docs: **CC-BY-SA-4.0**. The frozen base model (Gemma 4) is subject to Google's
86
  Gemma terms; this work distributes the **channel adapter and method**, not Gemma's weights.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
84
 
85
  Code: **GPL-3.0**. Paper & docs: **CC-BY-SA-4.0**. The frozen base model (Gemma 4) is subject to Google's
86
  Gemma terms; this work distributes the **channel adapter and method**, not Gemma's weights.
87
+
88
+ ## What's in this repo
89
+
90
+ | file | what |
91
+ |---|---|
92
+ | `adapter_model.safetensors` | the trained channel adapter — **185.8M params, bf16** (the integrated 6/6 checkpoint, step 10000) |
93
+ | `adapter_config.json` | dims, channel sizes, K-per-channel, base model |
94
+ | `modeling_channels.py` | the `ChannelInjectionDelta` + `ChanneledLayer` modules (post-layer gated cross-attention + ReZero) |
95
+ | `proprioceptive-channels.pdf` | the paper |
96
+
97
+ This is the **channel adapter only** — not Gemma's weights. It wraps a **frozen `google/gemma-4-E2B-it`**
98
+ (35 text layers, hidden 1536); load Gemma from Google, then wrap each layer with `ChanneledLayer` and load
99
+ these weights. With ReZero α at its trained values the channels drive behavior; with α=0 the model is
100
+ bit-exact to vanilla Gemma. See `modeling_channels.py` and the paper for the wiring and the six channel
101
+ dimensions (memory 1024 · affect 2 · time 16 · ethics 24 · identity 1024 · continuity 1024).