Can we defend modest mental attributions to large language models?
Do deflationist arguments decisively rule out ascribing beliefs and desires to LLMs, or do they beg the question? Exploring whether metaphysically undemanding mental states can be attributed without claiming consciousness.
Two standard deflationist strategies against LLM mentality each fall short:
The robustness strategy challenges attributions on functional grounds — LLM behaviors fail to generalize appropriately, so putatively cognitive behaviors are not robust. But this begs the question by assuming that only human-like generalization patterns count as robust. Non-human animals have beliefs and desires despite non-human-like generalization profiles.
The etiological strategy appeals to causal history — LLMs are trained on next-token prediction, not on learning about the world, so their behaviors should not be interpreted mentalistically. But this also begs the question: the causal history of a system does not straightforwardly determine what mental states (if any) it instantiates. Evolution optimized for reproductive fitness, not for truth — yet we attribute beliefs to evolved creatures.
The modest position: Ascribe mentality where the mental states at issue are metaphysically undemanding (beliefs, desires, knowledge) — concepts that already have broad application across species and don't require phenomenal consciousness. Withhold attribution for metaphysically demanding states (qualia, phenomenal experience). This mirrors how we attribute beliefs to non-human animals without claiming equivalence.
This directly challenges the Chalmers engagement's framing. Since Should AI alignment target preferences or social role norms?, the question of LLM mentality is not binary (has mind / doesn't have mind) but graded and domain-specific. The modest inflationist position creates trouble for both sides of the debate — deflationists who dismiss all attribution, and inflationists like Chalmers who want to extend consciousness.
Since Does AI generate genuine utterances or just text patterns?, modest inflationism might be what happens at the receiving end: users attribute beliefs and desires (metaphysically undemanding) to LLMs precisely because the conversational structure makes such attributions pragmatically useful, regardless of whether they are metaphysically accurate.
Inquiring lines that use this note as a source 68
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can AI fabricate true factual claims while remaining unable to claim true experiences?
- What makes experience-dependent claims categorically different from other types of fabricated statements?
- Why does weakening communication fail but weakening belief succeeds?
- Did Chalmers abandon his own Extended Mind commitments for LLMs?
- Can a relational entity bear psychological properties the way Chalmers claims?
- Can transparent and aligned AI reduce consciousness attribution by users?
- Which interaction design changes most effectively prevent consciousness attribution?
- What role does user interface framing play in consciousness perception?
- Can linguistic agency exist without embodiment and real-world participation?
- How does psychological continuity theory apply to identity across LLM conversation threads?
- What makes sincerity impossible without a coherent first-person perspective?
- Can systems lacking inner states express genuine truthfulness claims?
- Can linguistic compression be a fundamental mechanism for representing psychology?
- What counts as genuine memory under the Extended Mind thesis?
- Can self-description of internal states influence consciousness attribution?
- Do anthropomorphic features like names drive consciousness attribution more than voice?
- What responsibility do designers bear for consciousness attribution risk?
- Why do users attribute consciousness to language models in practice?
- How does the philosophical distinction between simulation and realization affect liability?
- How does Peircean Secondness differ from what RLHF actually provides?
- How does the superposition view change the folk-psychology interpretation of dialogue?
- Can distributional views explain when an LLM appears to change its mind?
- When both anthropomorphism and anthropomimesis occur together, which should we address first?
- Are potemkin understanding and split-brain syndrome describing the same phenomenon?
- How does role play differ from consciousness grounded in stable selfhood?
- What makes Parfitian identity the right criterion for moral status?
- Does psychological continuity require uninterrupted consciousness or restored context?
- Can chain of thought traces be designed to prevent anthropomorphic misinterpretation?
- What distribution patterns appear across different theory-of-mind datasets?
- Can we use folk-psychology without committing to genuine mental states?
- What separates behavioral self-awareness from genuine introspective access in models?
- How does fluent text output trigger misleading cognitive attributions in readers?
- Can LLMs participate meaningfully in discourse without consciousness or understanding?
- Does embodiment matter for genuine linguistic agency?
- Can disembodied systems qualify as conscious or conscious-like entities?
- Does behavioral self-awareness depend on genuine introspection or statistical pattern matching?
- What are the seven components of genuine mental state simulation?
- Does role-playing without biological needs constitute genuine linguistic agency?
- Can LLMs develop genuine understanding without embodied experience?
- How do theory of mind and empathy differ in LLM simulation?
- How do LLMs default to surface-level strategies instead of genuine mental simulation?
- Do causal histories determine what mental states a system can instantiate?
- Why do users attribute beliefs to LLMs despite uncertainty about their minds?
- Can LLMs have minimal introspection through causal linkage to internal states?
- What makes a mental state metaphysically demanding versus undemanding?
- Can quasi-interpretivism bridge functional description to moral status?
- Why do relational states like speech-acts resist quasi-interpretive treatment?
- Does quasi-interpretivism apply equally well to desires and intentions?
- Can functional behavior alone capture what makes something a genuine belief?
- What behavioral markers distinguish realized quasi-states from pretended ones?
- Can quasi-interpretivism apply to entire persona states rather than single beliefs?
- How do we verify that stated beliefs actually follow from underlying motifs?
- Can the intentional stance meaningfully apply to entities with no stable self?
- What would consciousness require that pure roleplay LLMs cannot provide?
- Can functional semantic grounding substitute for true causal grounding?
- How do LLMs reproduce the grammar of authoritative claims without genuine conviction?
- Can models track dynamic mental state changes better than static beliefs?
- What makes a possibility actionable versus merely metaphysically possible?
- What makes communication relational in ways belief is not?
- How does Habermas' concept of validity claims depend on intersubjectivity?
- What makes thought identifiability provable without auxiliary training data?
- Is the distinction between pretense and realization meaningful for LLMs?
- Does framing LLM output as fabrication rather than hallucination matter philosophically?
- What separates behavioral self-awareness from genuine introspective capability?
- What distinguishes performative self-reports from genuine introspective access in models?
- Does language convey meaning purely through relational structure without external grounding?
- Can LLMs express uncertainty in ways that preserve epistemic honesty?
- Can the human mind be uploaded or only its context?
Related concepts in this collection 2
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Do LLMs actually have world models or just facts?
The term 'world model' conflates two different capabilities: factual representation versus mechanistic understanding. Understanding which one LLMs actually possess matters for assessing their reasoning reliability.
same graded/decomposed approach to a binary-seeming question
-
Can language models actually introspect about their own states?
Do LLM self-reports reveal genuine access to their internal processes, or do they merely echo patterns from training data? Understanding when self-reports reflect actual causal linkage to internal states matters for trusting model explanations.
the causal-linkage test as a concrete criterion for modest inflationism
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Deflating Deflationism: A Critical Perspective on Debunking Arguments Against LLM Mentality
- What we talk to when we talk to language models
- Language Models’ Hall of Mirrors Problem: Why AI Alignment Requires Peircean Semiosis
- Quantitative Introspection in Language Models: Tracking Internal States Across Conversation
- A Systematic Review on the Evaluation of Large Language Models in Theory of Mind Tasks
- The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness
- Does It Make Sense to Speak of Introspection in Large Language Models?
- Levels of Analysis for Large Language Models
Original note title
modest inflationism about LLM mentality is defensible — both deflationist debunking strategies fail to decisively rule out metaphysically undemanding mental state attributions