Can agents reconstruct memory on demand instead of retrieving it?
Explores whether interleaving reasoning with memory traversal during retrieval beats the standard approach of fetching memories first then reasoning over them. Matters because it could reduce wasted token cost and improve agent adaptability.
Most memory-augmented agents run a rigid pipeline: retrieve a fixed set of memories by similarity, then reason over them. MRAgent's claim is that this ordering is the bug. Because the retrieval step is committed before the model has seen any intermediate evidence, the agent cannot adapt what it looks up based on what it discovers mid-inference. The fix is to interleave reasoning directly into memory access over an associative Cue–Tag–Content graph, so retrieval becomes an active, multi-step reconstruction — the agent iteratively explores and prunes traversal paths conditioned on accumulated evidence, which yields up to 23% gains on LOCOMO and LONG-MEMEVAL while cutting token and runtime cost.
The titular framing — memory is reconstructed, not retrieved — is a genuine reframing borrowed from human cognition, where recall is a constructive act rather than a lookup. The deeper architectural move is deferring relational reasoning to the retrieval stage: instead of pre-computing all relational structure into the graph (the knowledge-graph instinct), MRAgent keeps construction simple and resolves complex dependencies on demand through targeted, state-dependent exploration. This is the traversal-side complement to Should agent memory adapt dynamically based on execution feedback? and to Is agent memory a storage problem or a connectivity problem? — both locate memory's value in dynamic access over a connected structure rather than in the store itself. It also operationalizes Can agents fail from weak memory control rather than missing knowledge?: the failure was control over access, and active reconstruction is that control.
The cost is the mirror image of the benefit, and the paper is candid about it. Because relational reasoning is deferred to retrieval, reconstruction cost grows with exploration depth — queries needing many traversal hops incur higher latency than single-shot retrieval. So the win is not unconditional: active reconstruction pays off when relational structure is sparse and query-specific, but a query that must traverse deeply can cost more than just dumping a large retrieved context would have. The pruning mechanism is doing the real work of keeping this from exploding combinatorially.
Inquiring lines that use this note as a source 15
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- How do memory hygiene and context efficiency trade off in deployed agents?
- Why do agents ignore condensed experience in favor of raw data?
- How does indiscriminate memory injection cause multi-turn agent failures?
- When does forcing agent reasoning into code become a leaky abstraction?
- Why does consolidating more state sometimes hurt performance below the no-memory baseline?
- How should future memory systems control what gets written and trusted?
- Why does memory effectiveness depend on connectivity rather than storage volume?
- What causes multi-turn agent failures: weak memory control or missing knowledge?
- When does active reconstruction cost more than simple context dumping?
- How should agent memory links evolve based on execution feedback?
- Does effective feedback compute matter more than raw token expenditure for agent scaling?
- Why is long-context compute spent transforming context into internal state rather than storing it?
- Why do persistent AI systems require fundamentally different design than ad-hoc supporters?
- Does selective history retrieval outperform full context inclusion in agent reasoning?
- What discarding policy prevents both stale entries and loss of rare critical knowledge?
Related concepts in this collection 3
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Should agent memory adapt dynamically based on execution feedback?
Can agents improve performance by continuously reshaping memory connections in response to whether tasks succeed or fail, rather than relying on fixed retrieval pipelines? This matters because static memory degrades in changing environments.
convergent-with: both reject fixed pipelines for dynamic, feedback-driven memory access; MRAgent prunes at traversal time
-
Is agent memory a storage problem or a connectivity problem?
Most systems treat memory as a repository to store and retrieve. But what if memory's real usefulness depends on how units are linked together rather than what is stored?
grounds: explains why a traversable associative graph beats a similarity store
-
Can agents fail from weak memory control rather than missing knowledge?
As multi-turn agent workflows grow longer, performance degrades—but is this due to insufficient context or poor memory management? This explores whether memory *control* is the real bottleneck.
extends: active reconstruction is a concrete mechanism for the memory control these failures lack
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents
- ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
- Toward Efficient Agents: A Survey of Memory, Tool Learning, and Planning
- Useful Memories Become Faulty When Continuously Updated by LLMs
- Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets
- The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
- Eliciting Reasoning in Language Models with Cognitive Tools
- DeepAgent: A General Reasoning Agent with Scalable Toolsets
Original note title
moving relational reasoning from storage into retrieval lets agent memory be reconstructed on demand rather than retrieved — reasoning interleaved with graph traversal beats retrieve-then-reason