SynapNet — Episodic Memory Checkpoints
Hybrid SSM + sparse-attention + episodic-memory architecture. This repo holds the base architectural checkpoints; the edge-deployment variants live at Vineetha00/synapnet-edge.
📦 Code: https://github.com/vineetha00/SynapNet_Exp 🛠️ Deployment companion: https://github.com/vineetha00/SynapNet-Edge · 🤗 https://huggingface.co/Vineetha00/synapnet-edge
Checkpoints in this repo
| File | Variant | Size | Notes |
|---|---|---|---|
synapnet_memory_v4.pt |
Track 3 v4 (current best) | 5.7 MB | Trained with salience supervision λ=0.01; near-perfect episodic recall |
synapnet_memory_v2.pt |
Track 3 v2 | 6.8 MB | Earlier episodic-recall checkpoint |
synapnet_memory.pt |
Original episodic memory | 18.2 MB | Initial release of the WriteableMemory variant |
Architecture (Track 3 v4 — recommended)
SimpleSSM(depthwise conv, kernel_size=9) for local temporal dynamicsSparseEventAttention(salience-gated top-K mixing)WriteableMemory(top-K hidden states written to a fixed-size bank, read via cross-attention)- Gated fusion (α, β) over the three pathways
- 4 stacked blocks, dim=128, heads=4
Key finding (λ-sweep, 5 seeds, ctx=2048, 32-class recall)
A small dose of salience supervision flips episodic recall from chance to near-perfect:
| λ (salience-supervision strength) | Accuracy | Write hit-rate (target token written?) |
|---|---|---|
| 0.00 | 0.723 ± 0.329 | 0.125 ± 0.109 |
| 0.01 | 0.968 ± 0.022 | 0.993 ± 0.002 |
| 0.10 | 0.970 ± 0.021 | 0.995 ± 0.001 |
| 1.00 | 0.838 ± 0.125 | 0.995 ± 0.002 |
The v4 checkpoint was trained at the sweet-spot λ=0.01.
Loading
import torch
from huggingface_hub import hf_hub_download
# Make sure SynapNet_Exp is on your path:
# git clone https://github.com/vineetha00/SynapNet_Exp
# cd SynapNet_Exp
import sys; sys.path.insert(0, "/path/to/SynapNet_Exp")
from synapnet_memory import SynapEpisodicNet
ckpt_path = hf_hub_download(
repo_id="Vineetha00/synapnet",
filename="synapnet_memory_v4.pt",
)
state = torch.load(ckpt_path, map_location="cpu")
model = SynapEpisodicNet(
dim=128, depth=4, vocab_size=2048,
max_len=2048, num_classes=32, heads=4,
)
model.load_state_dict(state)
model.eval()
Training tasks (from the companion code)
- Track 1 — LRA-style long-range classification (
train_track1_lra.py) - Track 2 — biosignal regression / ECG reconstruction (
train_track2_biosignal.py) - Track 3 — episodic recall (
train_track3_memory_v2/v3/v4.py) - Track 4 — faithful-salience supervision for interpretability (
train_track4_interpret.py)
License
MIT — see LICENSE.
Citation
@article{synapnet_2026,
title={SynapNet: Hybrid SSM + Sparse-Attention + Episodic Memory for Long-Range Sequence Modelling},
author={Vallish Kumar, Vineetha},
year={2026},
}
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support