Can cognition work by reusing memory instead of recomputing?
Does intelligence emerge from structured navigation of prior inference paths rather than fresh computation? This challenges whether brains and AI systems need to recalculate constantly or can leverage stored trajectories for efficiency.
Memory-Amortized Inference (MAI) proposes that intelligence is fundamentally non-ergodic: it emerges from structured reuse of prior inference trajectories, not from uniform sampling or optimization from scratch. This is a sharp departure from standard computational models where each inference begins fresh.
The core framework: cognition is modeled as inference over latent cycles in memory. Memory trajectories define topologically stable, entropy-minimizing paths through representational space. The system navigates over constrained latent manifolds guided by persistent topological memory — enabling context-aware, structure-preserving inference with dramatically reduced computational cost.
The most provocative claim is the time-reversal duality between MAI and RL: whereas RL propagates value forward from reward (bootstrapping over futures), MAI reconstructs latent causes backward from memory structures (inferring the past from its traces). Both rely on partial, structure-aware updates to minimize uncertainty. This duality allows MAI to invert RL's reward-driven flow, replacing energy-intensive iteration with structure-aware reuse.
Practical implications:
- Energy efficiency: MAI addresses the computational bottleneck of modern AI by replacing brute-force recomputation with memory-based navigation
- Biological plausibility: Models each cortical column as a local inference operator over cycle-consistent memory states, providing a principled account of Mountcastle's Universal Cortical Algorithm
- Delta-homology: The mathematical framework uses topological tools to characterize the stability of memory cycles — inference is stable when memory trajectories form homologically persistent structures
This is highly theoretical and requires empirical validation. But the conceptual contribution is clear: if RL is "learning what to do from future rewards," MAI is "learning what happened from past memories." The suggestion that these are formally dual operations connecting cognition to decision-making opens a new theoretical perspective on the relationship between memory and agency.
Inquiring lines that use this note as a source 26
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- How do the six memory components combine across explicit and implicit paths?
- What does Wang mean by intelligence as adaptation with limited resources?
- How do humans use associative reasoning without causal connections?
- Does psychological continuity require uninterrupted consciousness or restored context?
- How do biological brains organize computation across different cortical timescales?
- Are traditional cognitive theories missing interaction effects between mechanisms?
- What specific cognitive failure prevents AI from detecting frame activation?
- How do cortical columns implement local inference over memory cycles?
- What makes memory trajectories topologically stable under persistent reuse?
- How do retrieved memories differ from decision-context passages for prediction?
- How do insert, forget, and merge operations maintain thought coherence over time?
- Does functional integration determine cognitive system boundaries?
- How does trajectory burstiness compare to other structural properties that shape emergent capabilities?
- Why is metacognition neglected as a foundational AI research area?
- What makes a memory reachable in the right context?
- How does co-activation shape which memories become linked together?
- Why do successful and failed trajectories need different memory processing?
- What computational costs does closed-loop memory refinement introduce?
- What distinguishes formation, evolution, and retrieval as separate memory dynamics?
- Why does the hot-path cold-path split map onto formation and evolution?
- Can deterministic computation actually create new information in data?
- Can offline recurrent passes replicate sleep-based memory consolidation in AI?
- How does the hippocampus bind disparate elements without storing everything itself?
- What gets lost when we describe memory as retrieval?
- How does continuous implicit memory formation differ from explicit memory encoding?
- How does treating cognition as computation reshape education and work?
Related concepts in this collection 3
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
When should AI systems do their thinking?
Most AI inference happens when users ask questions, but what if models could think during idle time instead? This explores whether shifting inference to before queries arrive could fundamentally change system design.
MAI deepens this: precomputation is memory-amortized inference, where thinking happens by navigating prior trajectories rather than computing anew
-
Can agents learn from failure without updating their weights?
Explores whether language models can improve through trial and error by storing reflections in episodic memory rather than fine-tuning. This matters because it suggests a fundamentally different path to agent adaptation.
MAI provides a formal framework for why episodic memory enables learning: reuse of inference trajectories
-
Can semantic knowledge shift model behavior like reinforcement learning does?
Can textual descriptions of successful reasoning patterns, prepended as context, achieve the same distribution shifts that RL achieves through parameter updates? This matters because it could eliminate the need for expensive fine-tuning on limited data.
compatible: experience-as-prior is a form of memory-amortized inference
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Beyond Turing: Memory-Amortized Inference as a Foundation for Cognitive Computation
- Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering
- Rethinking Memory as Continuously Evolving Connectivity
- Artifacts as Memory Beyond the Agent Boundary
- Levels of Analysis for Large Language Models
- Useful Memories Become Faulty When Continuously Updated by LLMs
- The AI Hippocampus: How Far are We From Human Memory?
- AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Original note title
memory-amortized inference models cognition as navigation over constrained latent manifolds — the time-reversal dual of reinforcement learning