arxiv:2605.23986

MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing

Published on May 16

· Submitted by

CHEN Han on May 26

Upvote

Authors:

Han Chen ,

Abstract

MemForest presents a memory framework for long-context LLM agents that improves scalability and reduces latency through parallel chunk extraction and hierarchical temporal indexing.

AI-generated summary

Memory is a fundamental component for enabling long-context LLM agents, supporting persistent state across interactions through a continuous serve-and-update lifecycle. Despite substantial prior work, existing systems suffer from significant maintenance overhead due to two key limitations: coarse-grained state management and inherently sequential update pipelines. In particular, updates are often tightly coupled with LLM inference and require full-state rewrites, leading to poor scalability and growing latency as memory accumulates. To address these challenges, we present MemForest, a memory framework that reformulates agent memory as a write-efficient temporal data management problem. MemForest breaks the sequential bottleneck via parallel chunk extraction, decoupling memory construction into concurrent, independent operations. To further eliminate coarse-grained maintenance, we introduce MemTree, a hierarchical temporal index that organizes memory as time-ordered trees rather than flat global summaries. This design replaces full-state rewrites with localized per-node updates, reducing maintenance cost to the affected tree paths while naturally preserving temporally evolving states. We evaluate MemForest on two long-context memory benchmarks, LongMemEval-S and LoCoMo. On LongMemEval-S, MemForest achieves the best overall performance among stateful baselines, reaching 79.8% pass@1 accuracy while sustaining a memory construction throughput approximately 6x higher than state-of-the-art approaches including EverMemOS.

View arXiv page View PDF GitHub 14 Add to collection

Community

Concyclics

Paper author Paper submitter about 18 hours ago

A Latency optimized parallel write Agent Memory System.

avahal

about 4 hours ago

the most interesting bit for me is the MemTree idea, a time-ordered hierarchical index that localizes maintenance to the touched paths instead of rewriting the whole memory. that per-node update pattern plus the lazy refresh of interval summaries and root rows makes the write path truly parallel and chunk-driven. it also clarifies retrieval: you go from root-based recall to interval-summary guided tree browse, which preserves temporal fidelity without brute-force rewrites. btw, the arxivlens breakdown helped me parse the method details, especially how the session/entity/scene scoped trees interplay. one practical question: how does MemForest fare when memory content becomes highly non-stationary, would dynamic re-scoping keep freshness high without blowing up maintenance?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2605.23986

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.23986 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.23986 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.23986 in a Space README.md to link it from this page.