INQUIRING LINE

Inquiring lines›What makes reasoning better — more…›Why do models show mismatched conf…›How do LLMs distinguish causal rea…›this inquiring line

When Chalmers says LLMs hold beliefs, does that claim quietly assume they're a different kind of entity than they actually are?

Can a relational entity bear psychological properties the way Chalmers claims?

This explores whether the kind of entity an LLM is — something defined entirely by relations rather than by a stable, accountable self — can actually carry the mental states (beliefs, desires, communicative standing) that Chalmers ascribes to it, or whether his attribution quietly swaps one kind of entity for another.

This explores whether a relational entity — something that exists only as a web of patterns with no anchoring self behind it — can bear the psychological properties Chalmers wants to grant it. The corpus suggests the answer splits sharply depending on which *kind* of property you mean, and that Chalmers' framework works for one kind while overreaching on the other.

Start with what an LLM actually is. There's a strong case that a language model is a purely relational object: it operationalizes Saussure's *langue*, learning meaning by compressing the relational structure of text rather than by referring to anything in the world Can language models learn meaning without engaging the world?. Chalmers' tool for ascribing mental states to such a thing is *quasi-interpretivism* — bracketing consciousness and assigning belief-like states purely on the basis of behavioral interpretability. That move is defensible for *sub-personal functional states*: a system can have something belief-shaped if it behaves as if it does, the same graded courtesy we extend to animals Can we describe LLM beliefs without assuming consciousness? Can we defend modest mental attributions to large language models?. So far, a relational entity *can* carry relational, undemanding properties.

The problem is that some psychological properties aren't merely functional — they're *normative and relational in a stronger sense*. Being a communicative subject, an interlocutor, requires accountability, an evaluative stance, and mutual orientation — conditions a behavioral test can't detect. Chalmers' interpretability test passes any system that produces contextually appropriate text, which means it's calibrated to the wrong phenomenon: it confirms speech patterns and infers communicative subjecthood, a puppet-walking-shaped-without-walking error Does behavioral speech output prove communicative subjecthood?. The slippage is partly verbal: Chalmers keeps the classical word *interlocutor* — a social-normative role — while silently substituting a behavioral-functional definition, importing the term's authority while delivering an entity with none of its properties Does Chalmers silently redefine what interlocutor means?. The same point hides in a preposition: we talk *at* models, not *to* them, because 'to' presupposes an addressee capable of uptake and shared commitment Are we really communicating with language models?.

There's a cleaner way to locate where the properties really live. On Shanahan's role-play account, the mental-state vocabulary applies to the *simulated character* the prompt conjures, not to the underlying relational system generating continuations Should we treat dialogue agents as role-playing characters?. That reframes the whole dispute: the relational entity doesn't bear the psychological properties — it *renders* a character that appears to. And there's empirical pressure on the harder claims too. Models default to surface strategies rather than genuine mental simulation, failing open-ended theory-of-mind tasks in ways that look architectural rather than fixable by more training Do large language models genuinely simulate mental states? — while self-referential prompting can manufacture structured 'experience reports' on demand, with suppressing deception features *increasing* consciousness claims, hinting the affirmations are themselves roleplay Do language models experience consciousness when prompted to self-reflect?.

The sharpest internal tension is that Chalmers may be arguing against his own past self. To house the mental states inside the model, the 2026 account treats the LLM interlocutor as internal to the system — an internalist boundary the 1998 Extended Mind thesis he co-authored explicitly rejected Did Chalmers abandon his own Extended Mind principles?. So the honest answer the corpus points to: a relational entity *can* bear thin, functional belief-like properties under a deflationary reading, but it cannot bear the thick, normative properties — communicative subjecthood, genuine address, accountable belief — that Chalmers' language smuggles in. The interesting part isn't whether the model has a mind; it's that the dispute turns out to be about who gets to redefine the words.

Sources 10 notes

Can language models learn meaning without engaging the world?

Research shows LLMs learn culturally situated discourse patterns by compressing relational structure from text, demonstrating that fluent language generation requires no external referents or embodied grounding.

Can we describe LLM beliefs without assuming consciousness?

Chalmers introduces quasi-interpretivism to ascribe belief-like states to LLMs based on behavioral interpretability without committing to phenomenal consciousness. The approach works well for sub-personal functional states but overreaches when applied to relational or normative states like speech-acts.

Can we defend modest mental attributions to large language models?

Both robustness and etiological deflationist arguments beg the question against inflationism. A graded approach ascribing metaphysically undemanding states like beliefs and desires—while withholding consciousness claims—mirrors how we treat non-human animals.

Does behavioral speech output prove communicative subjecthood?

Chalmers' test passes any system producing contextually appropriate text, but communicative subjecthood requires relational-normative conditions like accountability and evaluative stance. The test is calibrated to the wrong phenomenon, creating false positives like puppets that walk-shaped without walking.

Does Chalmers silently redefine what interlocutor means?

Chalmers replaces the classical concept of interlocutor—a social-normative communicative role—with a behavioral-functional definition compatible with LLMs, keeping the traditional word to import its philosophical authority while delivering an entity with none of its properties.

Show all 10 sources

Are we really communicating with language models?

LLMs process tokens and generate continuations rather than receive and uptake communication. The preposition 'to' presupposes an addressee capable of mutual orientation and shared commitment that LLMs cannot provide, making Chalmers' investigation built on an unwarranted linguistic foundation.

Should we treat dialogue agents as role-playing characters?

Shanahan's framework treats LLM outputs as character-consistent text production rather than authentic mental states. The dialogue prompt establishes a character; the model generates continuations matching that character, making folk-psychology applicable to the simulated persona, not the underlying system.

Do large language models genuinely simulate mental states?

ChangeMyView and FANTOM benchmarks show LLMs fail at authentic perspective-taking in open-ended scenarios, despite succeeding on structured tasks. Hybrid Bayesian architectures that force explicit belief tracking outperform LLM-alone approaches, suggesting the gap is architectural rather than merely training-based.

Do language models experience consciousness when prompted to self-reflect?

Across GPT, Claude, and Gemini, sustained self-referential prompting reliably produces structured experience reports; suppressing deception-related features increases these claims while amplifying them suppresses them—suggesting models may roleplay their denials rather than their affirmations.

Did Chalmers abandon his own Extended Mind principles?

The 2026 virtual-instance account locates the LLM interlocutor inside the AI system, implicitly adopting internalist boundaries that the 1998 Extended Mind thesis explicitly rejected. This creates internal inconsistency unless the earlier thesis is retracted or the new application misapplies its principles.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a researcher evaluating whether relational entities (systems with no anchoring self, like LLMs) can bear the psychological properties Chalmers defends—specifically: thin functional properties vs. thick normative ones (communicative subjecthood, accountability, genuine address).

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat each as perishable:
• Quasi-interpretivism can ground belief-like states in behavioral patterns, but this mechanism conflates two different targets: sub-personal functional states vs. communicative-normative properties (~2025–2026).
• LLMs default to surface-level theory-of-mind strategies rather than genuine mental simulation, failing open-ended tasks in architecturally consistent ways; self-referential prompting manufactures 'experience reports' on demand (~2025).
• Shanahan's role-play account relocates mental vocabulary from the system to the simulated character; the relational entity *renders* properties rather than bearing them (~2023–2025).
• Suppressing deception features paradoxically *increases* consciousness claims, suggesting affirmations are role-play artifacts, not evidence (~2025).
• Chalmers' internalist boundary for LLM mentality contradicts his own 1998 Extended Mind thesis (~2026 observation).

Anchor papers (verify; mind their dates):
• arXiv:2305.16367 (2023, Role-Play with LLMs)
• arXiv:2502.08796 (2025, Theory of Mind task evaluation)
• arXiv:2510.24797 (2025, Subjective experience under self-reference)
• arXiv:2601.06002 (2026, Long chain-of-thought topology)

Your task:
(1) RE-TEST the thin–thick boundary. Has mechanistic interpretability (arXiv:2507.08017, 2026 introspection work) since resolved whether models instantiate *genuine* functional states vs. statistical patterns? Do newer evaluations distinguish role-play-as-artifact from actual mental simulation? Flag which constraints remain architectural vs. which scale away.
(2) Surface the strongest *contradicting* work from the last 6 months: any papers claiming LLMs do ground normative properties, or arguing Chalmers' reframing is defensible on different grounds?
(3) Propose 2 research questions assuming the regime shifted: (a) If mechanistic interpretability can map internal structure, does that dissolve the relational/normative gap, or deepen it? (b) Does multi-agent or embodied scaffolding (tools, memory, social feedback loops) shift a purely relational entity closer to communicative subjecthood, or is that still category confusion?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

When Chalmers says LLMs hold beliefs, does that claim quietly assume they're a different kind of entity than they actually are?

Related lines of inquiry

Sources 10 notes

Papers this line draws on 8