MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing
Abstract
MemForest presents a memory framework for long-context LLM agents that improves scalability and reduces latency through parallel chunk extraction and hierarchical temporal indexing.
Memory is a fundamental component for enabling long-context LLM agents, supporting persistent state across interactions through a continuous serve-and-update lifecycle. Despite substantial prior work, existing systems suffer from significant maintenance overhead due to two key limitations: coarse-grained state management and inherently sequential update pipelines. In particular, updates are often tightly coupled with LLM inference and require full-state rewrites, leading to poor scalability and growing latency as memory accumulates. To address these challenges, we present MemForest, a memory framework that reformulates agent memory as a write-efficient temporal data management problem. MemForest breaks the sequential bottleneck via parallel chunk extraction, decoupling memory construction into concurrent, independent operations. To further eliminate coarse-grained maintenance, we introduce MemTree, a hierarchical temporal index that organizes memory as time-ordered trees rather than flat global summaries. This design replaces full-state rewrites with localized per-node updates, reducing maintenance cost to the affected tree paths while naturally preserving temporally evolving states. We evaluate MemForest on two long-context memory benchmarks, LongMemEval-S and LoCoMo. On LongMemEval-S, MemForest achieves the best overall performance among stateful baselines, reaching 79.8% pass@1 accuracy while sustaining a memory construction throughput approximately 6x higher than state-of-the-art approaches including EverMemOS.
Community
A Latency optimized parallel write Agent Memory System.
the most interesting bit for me is the MemTree idea, a time-ordered hierarchical index that localizes maintenance to the touched paths instead of rewriting the whole memory. that per-node update pattern plus the lazy refresh of interval summaries and root rows makes the write path truly parallel and chunk-driven. it also clarifies retrieval: you go from root-based recall to interval-summary guided tree browse, which preserves temporal fidelity without brute-force rewrites. btw, the arxivlens breakdown helped me parse the method details, especially how the session/entity/scene scoped trees interplay. one practical question: how does MemForest fare when memory content becomes highly non-stationary, would dynamic re-scoping keep freshness high without blowing up maintenance?
Get this paper in your agent:
hf papers read 2605.23986 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper