memory-augmented-generation

Sleeping

App Files Files Community

memory-augmented-generation / README.md

Pavantej

Upload folder using huggingface_hub

fbca19f verified 5 months ago

preview code

raw

history blame contribute delete

2.26 kB

A newer version of the Gradio SDK is available: 6.16.0

Upgrade

metadata

title: Titans Miras Demo
emoji: 🔬
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.36.1
app_file: app.py
pinned: false

Titans + MIRAS: A Brain That Changes Itself While Thinking

A minimal but faithful reimplementation of Titans (test-time learning) and MIRAS (associative memory framework) using open-source models on Hugging Face.

What is this?

This demo showcases a neural architecture that can learn and update its memory while generating responses - a brain that literally changes itself while thinking!

Key Features

🔄 Test-time learning: Memory updates during inference (not just training)
🎯 Retention gate: Surprising/novel inputs are more memorable (inspired by human memory)
💾 Persistent memory: State is saved across sessions
🤖 Fully OSS: Uses distilgpt2 and runs entirely on Hugging Face

Architecture

User Input
    ↓
[Base LM: distilgpt2] → Hidden States (768-dim)
    ↓
[Key/Value Projections] → Memory Space (256-dim)
    ↓
[MIRAS Memory Module] ← Test-time Gradient Updates
    ↓
[Text Generation] → Response + Memory Stats

Components

Base Language Model: distilgpt2 (frozen, no training)
Projection Layers: Map hidden states to memory space
MIRAS Memory: Associative memory with learnable key→value mapping
Retention Gate: Adjusts learning rate based on surprise (loss magnitude)
Memory Store: Persists memory state to disk

How It Works

Input text is processed through distilgpt2
Last hidden state is projected to key/value pairs
Memory predicts value from key
Loss (prediction error) indicates surprise
Higher surprise → higher retention → faster learning
Memory updated via gradient descent (1e-3 base LR)
Response generated and memory saved

References

Titans: Learning to Memorize at Test Time
MIRAS: Framework for Associative Memory with Attentional Bias

Running Locally

pip install -r requirements.txt
python app.py

Built with ❤️ exploring the future of adaptive AI systems.