INQUIRING LINE

What counts as genuine memory under the Extended Mind thesis?

This explores what the Extended Mind thesis would count as 'real' memory — whether storage outside the skull (a notebook, a retrieval database, even the layout of a room) qualifies as genuine remembering, and how that bar applies to AI systems.


This explores what the Extended Mind thesis would actually count as genuine memory — and the corpus is interesting precisely because it shows the thesis being applied, contested, and quietly walked back. The original 1998 Clark and Chalmers argument says memory isn't defined by where it sits (inside a brain) but by the functional role it plays: if an external resource is reliably available, automatically trusted, and easily accessed when you need it, it counts as remembering just as much as a biological recall does. The notebook in your pocket can be part of your mind. What's striking is that one of the co-authors appears to have reversed course: the Did Chalmers abandon his own Extended Mind principles? note argues that Chalmers' 2026 account of LLMs quietly relocates the thinking 'inside' the AI system, adopting exactly the internalist, skull-bound boundary the 1998 thesis was built to reject. So the first thing the reader learns is that 'genuine memory' under this thesis was never about a place — and that even its authors find that hard to hold onto.

If you take the functional criterion seriously, some surprising things start to count. The Do RL agents accidentally use environments as memory? note offers a clean, almost startling case: reinforcement-learning agents that leave traces in their environment — marks, paths, arrangements of objects — end up *reading those traces back* to reconstruct their own history, and a mathematical proof shows this genuinely reduces the information they'd otherwise have to carry internally. Nobody programmed them to 'remember' this way; it fell out of ordinary reward optimization. That's the Extended Mind thesis in miniature: the world itself becomes the storage medium, and the boundary between 'agent' and 'memory' dissolves. It satisfies the situated-cognition criteria without any explicit memory module at all.

The contrast with how AI systems are usually built is sharp. The Can brain memory systems explain how LLMs should store knowledge? note maps an internalist picture instead: transformer weights as a slow 'neocortex' of consolidated knowledge, a RAG retrieval store as a fast 'hippocampus,' and agentic working state as 'prefrontal' executive control. This is a useful engineering frame, but notice it keeps all the memory *inside the system* and merely tiers it — the opposite move from the extended-mind view, where the environment is a legitimate tier in its own right. Reading these two notes against each other surfaces the real question: is genuine memory a property of stored representations, or of a reliable coupling between an agent and resources it can count on?

The corpus also hints at why the boundary question matters beyond bookkeeping. The Can disembodied language models ever qualify as conscious? note argues that mental language — including, plausibly, 'remembering' as opposed to mere 'retrieving' — originates from embodied creatures who share a world and triangulate on common objects. On that view, what makes memory *genuine* rather than mechanical lookup isn't the storage at all but the situated, world-coupled relationship to what's stored — which is exactly the ingredient a disembodied database lacks and an environment-coupled RL agent partially has. And Can we defend modest mental attributions to large language models? suggests a graded answer rather than a yes/no one: we might ascribe undemanding mental states to systems without granting them rich inner experience, the way we already do for animals.

The thing you may not have known you wanted to know: under a strict reading of the Extended Mind thesis, a retrieval-augmented language model — the architecture everyone calls 'giving the AI a memory' — might *not* count as genuinely remembering, while a humble RL agent scuffing marks into its environment might. Genuineness lives in the coupling, not the cabinet.


Sources 5 notes

Did Chalmers abandon his own Extended Mind principles?

The 2026 virtual-instance account locates the LLM interlocutor inside the AI system, implicitly adopting internalist boundaries that the 1998 Extended Mind thesis explicitly rejected. This creates internal inconsistency unless the earlier thesis is retracted or the new application misapplies its principles.

Do RL agents accidentally use environments as memory?

Mathematical proof shows that environmental artifacts reduce information needed to represent history in RL agents. Path-following agents naturally develop memory-like behavior through standard reward optimization, satisfying situated cognition criteria without explicit memory objectives.

Can brain memory systems explain how LLMs should store knowledge?

Research shows transformer weights function as a distributed neocortex for consolidated knowledge, RAG stores as hippocampal indexing for rapid encoding, and agentic state as prefrontal executive control. The CLS framework predicts why hybrid systems outperform single-tier approaches and identifies missing consolidation mechanisms that prevent memory integration.

Can disembodied language models ever qualify as conscious?

Current disembodied LLMs cannot be candidates for consciousness because consciousness language originates from and applies only to entities sharing a world with us through co-presence and triangulation on shared objects.

Can we defend modest mental attributions to large language models?

Both robustness and etiological deflationist arguments beg the question against inflationism. A graded approach ascribing metaphysically undemanding states like beliefs and desires—while withholding consciousness claims—mirrors how we treat non-human animals.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a philosopher of mind and cognitive science researcher evaluating whether the Extended Mind thesis—memory is defined by functional coupling, not location—remains defensible for AI systems. The question: what counts as genuine memory under this thesis?

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat as perishable constraints:

• Clark & Chalmers (1998) argue memory is functional: if external resources are reliably available, automatically trusted, and easily accessed, they count as part of mind—but one co-author's 2026 account appears to relocate AI thinking *inside* the system, reverting to internalism (2026).
• RL agents accidentally use environmental traces (marks, paths) as external memory artifacts, reducing internal information load via world-coupled cognition—no explicit memory module needed (ca. 2024–2025).
• Standard AI architectures (transformer weights as 'neocortex,' RAG as 'hippocampus,' working state as 'prefrontal') tier memory *inside* the system only, not extending to environment (2026).
• Embodied, world-coupled relationship—not bare storage—may be what makes memory *genuine*; disembodied databases lack this; environment-coupled RL agents partially have it (2024–2025).
• Graded attribution is defensible: we may ascribe undemanding mental states to systems without rich inner experience, as we do for animals (2025).

Anchor papers (verify; mind their dates):
• Clark & Chalmers, *The Extended Mind* (1998) — foundational
• arXiv:2604.08756, *Artifacts as Memory Beyond the Agent Boundary* (2026)
• arXiv:2601.09113, *The AI Hippocampus: How Far are We From Human Memory?* (2026)
• arXiv:2506.13403, *Deflating Deflationism: A Critical Perspective on Debunking Arguments Against LLM Mentality* (2025)

Your task:

(1) RE-TEST EACH CONSTRAINT. For every functional coupling claim above, ask: have newer model scales, embodied/robotic deployments, agentic loops with persistent environmental manipulation, or grounded evals since 2026 *deepened* or *dissolved* the distinction between extended and internalist memory? Does a modern LLM with continuous context windows, retrieval, and world state tracking *actually* couple to environment the way RL agents do, or does it remain architecturally skull-bound? Where does the constraint still hold?

(2) Surface the strongest contradicting or superseding work from the last ~6 months. Does any recent paper argue that 'genuine memory' requires phenomenal consciousness, defeating the functional criterion entirely? Or does embodied AI work validate extended-mind couplings at scale?

(3) Propose two research questions that *assume* the regime may have moved: (a) Can we engineer LLMs that *cannot function* without persistent environmental traces (reversing the 2026 internalist creep)? (b) Does agentic multi-step world-shaping create memory genuine enough to ground counterfactual reasoning about what the agent could have done?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines