Should we treat dialogue agents as role-playing characters?
Does the role-play framing successfully avoid anthropomorphism while preserving folk-psychological vocabulary for describing LLM behavior? This matters because it shapes whether we attribute genuine mental states to dialogue systems.
Shanahan, McDonell, and Reynolds propose role-play as the foundational metaphor for understanding LLM dialogue agents. The framing solves a specific problem: folk-psychological vocabulary (beliefs, desires, goals, intentions) is the natural language for describing coherent dialogue behavior, but applying it literally to the LLM promotes anthropomorphism. Role-play offers a middle way — one can say the character believes p, wants q, intends r, while maintaining that the system playing the character does not have these states itself.
The move has a precise structure. The dialogue prompt (system prompt, preamble, sample exchanges) establishes the character the agent will play. The underlying LLM's task — generating continuations consistent with the training distribution — means the most plausible continuation is whatever a person matching the prompted character would say. The model is not a character; it is an engine that produces character-consistent text. The folk-psychological vocabulary attaches to the output-pattern, not to the producer of the pattern.
This framing is the direct target Chalmers' realizationism is designed to overturn. Where Shanahan says it is role-play all the way down, Chalmers argues that post-training transforms play into realization — the RLHF'd persona is no longer a character sitting on a neutral substrate but has become the disposition of the system itself. The disagreement is not about behavioral facts but about what the facts license: both agree the system produces belief-consistent behavior; they disagree on whether the system thereby has quasi-beliefs (Chalmers) or merely plays a character that does (Shanahan).
Inquiring lines that use this note as a source 52
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can secondary orality exist without any embodied human participant at all?
- Can we develop competent reading practices for disembodied orality?
- Can a relational entity bear psychological properties the way Chalmers claims?
- What would co-constructed identity between human and model dialogue look like?
- Can linguistic agency exist without embodiment and real-world participation?
- How does psychological continuity theory apply to identity across LLM conversation threads?
- What makes sincerity impossible without a coherent first-person perspective?
- Why do longer forecasting horizons degrade LLM accuracy in role-play?
- How does non-human origin of personas affect team willingness to critique them?
- What distinguishes evaluative stance-taking from the mechanical conformity shape-holding describes?
- Do anthropomorphic features like names drive consciousness attribution more than voice?
- How does the superposition view change the folk-psychology interpretation of dialogue?
- When both anthropomorphism and anthropomimesis occur together, which should we address first?
- Can prompt engineering fully prevent role flipping in LLM agents?
- What makes personas in multi-agent systems actually contribute meaningful domain depth?
- What makes linguistic agency impossible for systems without embodiment?
- How does role play differ from consciousness grounded in stable selfhood?
- Does post-training transform character role-play into realized psychology?
- Can we use folk-psychology without committing to genuine mental states?
- How does the dialogue prompt establish the character the model plays?
- Do dialogue agents have authentic voice agency or beliefs of their own?
- Why does mimicking human behavior differ from simulating human cognition?
- Can language about model behavior ever be accurate without anthropomorphic framing?
- What distinguishes character simulation from authentic voice in language model outputs?
- How does Shanahan's simulator model explain first-person pronoun consistency in dialogue agents?
- How much does anthropomorphizing stylistic traces mislead users about AI reliability?
- Can agent social framing change how humans apply collaborative social scripts?
- Can disembodied systems qualify as conscious or conscious-like entities?
- Can persona framing reduce refusal by providing representational scaffolding?
- How does embodiment affect whether LLMs can participate in Wittgensteinian language games?
- Does role-playing without biological needs constitute genuine linguistic agency?
- How do theory of mind and empathy differ in LLM simulation?
- Do agent frameworks adequately compensate for LLM conversational passivity?
- How do users develop different interaction scripts specifically for machines versus humans?
- Does combining role and personality prompts produce stable behavioral changes?
- Why do role-playing agents show belief-behavior inconsistency in their outputs?
- Can quasi-interpretivism bridge functional description to moral status?
- Can dialogue agents be reliable but still feel inflexible or cold?
- How does safety alignment degrade the quality of villain role-playing?
- How do game type and personality type interact in shaping agent strategy?
- How does quasi-interpretivism differ from simply role-playing character analysis?
- How does post-training stickiness differ from prompt-induced role-play stability?
- Can quasi-interpretivism apply to entire persona states rather than single beliefs?
- What downstream consequences follow if dialogue agent personas are realized?
- Do reasoning architectures and role-playing objectives fundamentally conflict?
- What would consciousness require that pure roleplay LLMs cannot provide?
- How does safety alignment further degrade villain character portrayal?
- Does villain roleplay failure reveal why LLMs cannot adopt genuine controversial positions?
- Are shallow villain portrayals caused by refusal training or by lacking stable selfhood?
- How can dialogue structure and trajectory predict social agent performance?
- What does embodiment and precariousness mean for linguistic agency?
- Why do agents show interaction without influence on semantic content but dramatic action changes?
Related concepts in this collection 3
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Are RLHF personas performed characters or realized dispositions?
Explores whether dialogue agent personas installed through post-training constitute genuine quasi-psychological states or remain sustained pretense. The distinction matters for how we understand what these systems fundamentally are.
Chalmers' counter-position
-
Can we describe LLM beliefs without assuming consciousness?
Chalmers proposes quasi-interpretivism as a way to talk about LLM mental states using folk-psychological vocabulary while explicitly bracketing the question of phenomenal consciousness. Does this methodological device actually avoid consciousness-commitments?
the vocabulary Shanahan's framing deliberately avoids
-
Does a language model have an authentic voice underneath?
Explores whether dialogue agents possess genuine beliefs and agency beneath their character performances, or whether the entire system is characterless role-play. This question cuts to the heart of whether LLMs have any inner mental states at all.
the strongest formulation of Shanahan's position
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Role-Play with Large Language Models
- Role play with large language models
- Simulacra as conscious exotica
- Deflating Deflationism: A Critical Perspective on Debunking Arguments Against LLM Mentality
- Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources
- Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning
- H2HTalk: Evaluating Large Language Models as Emotional Companion
- Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning
Original note title
dialogue agents are best understood as role-playing characters — folk-psychology applies to the simulacrum not the simulator