SYNTHESIS NOTE

Topics›Philosophy Subjectivity›this note

Can dialogue systems track both speakers' beliefs across turns?

Explores whether pragmatic reasoning frameworks can extend beyond single utterances to model how both conversation partners' understanding evolves. This matters because current dialogue systems lack principled ways to represent shared meaning-making.

Synthesis note · 2026-04-18 · sourced from Philosophy Subjectivity

The Rational Speech Act (RSA) framework models pragmatic reasoning as recursive social inference between speakers and listeners. But RSA has a fundamental limitation for dialogue: it handles single utterances, not evolving multi-turn conversations. CRSA fixes this by integrating a multi-turn gain function grounded in interactive rate-distortion theory.

The key extension: Both agents have private information. Each produces utterances conditioned on the full dialogue history. The gain function tracks evolving beliefs of both interlocutors — not just one listener inferring one speaker's intent, but bidirectional, progressive convergence of shared understanding.

Demonstrated on: referential games and template-based doctor-patient dialogues (disease diagnosis from symptoms). CRSA captures the progression from partial to shared understanding across turns.

A critical limitation acknowledged: there is no systematic way to model the meaning spaces, which are always application-dependent. And shifting from utterance-level to token-level reasoning (for scaling to real LLMs) may influence pragmatic capabilities — the reasoning granularity problem is unresolved.

This provides the mathematical framework that current LLM dialogue systems lack. Since the fluency gap — llm text is linguistically well-formed but communicatively empty because fluency substitutes for the grounding work that makes communication meaningful, CRSA offers a principled alternative: pragmatic reasoning grounded in information theory rather than next-token prediction. The question is whether token-level LLM generation can implement utterance-level pragmatic optimization.

Since Why do standard alignment methods ignore partner interventions?, CRSA's bidirectional belief tracking is the theoretical complement to the counterfactual invariance approach — one addresses it through reward engineering, the other through information-theoretic architecture.

Inquiring lines that read this note 73

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How should dialogue systems represent uncertainty from noisy speech input?

Does conversational format create illusions of genuine AI communication?

Why do multi-turn conversations degrade AI intent and coherence?

How can language models sustain linguistic synchrony and intersubjectivity during dialogue?

How can LLM user simulators model realistic goal-driven conversation?

How do language models establish social grounding in human dialogue?

How do formal dialogue structures reveal conversation coherence mechanisms?

What makes dialogue-based explanation more successful than monologue?

How should conversational agents balance goal-driven initiative with user control?

Can LLM personas constitute genuine psychology or remain linguistic role-play?

How does the superposition view change the folk-psychology interpretation of dialogue?

How do chatbots affect human self-disclosure and emotional engagement?

Why does a chatbot's intersubjective stance differ functionally from Otto's extended-mind notebook?

How faithfully do LLMs reflect their actual reasoning in outputs and explanations?

What makes active reasoning through dialogue harder than passive reasoning?

What makes AI persuasion effective and how can we counter it?

Why should disagreement be treated as signal in collaborative reasoning?

How should dialogue recommender systems manage conversation history and state?

What distinguishes dynamic from static grounding in dialogue systems?

Why does static grounding prevent AI systems from supporting dialectical reconciliation?

How do standardized protocols improve coordination in multi-agent systems?

Can structured artifact sharing replace direct latent thought communication?

Is embodied interaction necessary for language meaning and genuine agency?

Can language meaning emerge without joint attention and shared embodied interaction?

How does latent reasoning compare to verbalized chain-of-thought?

Does chain-of-thought reasoning improve mental state tracking in dialogue?

Why do language models reinforce false assumptions instead of correcting them?

What mechanisms enable AI systems to generate and spread false beliefs?

How do conversation dynamics push models toward false beliefs?

How should dialogue systems best leverage conversation history for retrieval?

Can next-token prediction alone produce genuine language understanding?

Can statistical token processing create the accountability needed for dialogue?

Related concepts in this collection 2

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 122 in 2-hop network ·dense cluster Open in graph ↗

Can dialogue systems track both speakers' belief… Why do standard alignment methods ignore partner i… Why do models fail at asking good questions during…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Why do standard alignment methods ignore partner interventions? Standard RLHF and DPO optimize for token-level quality but may structurally prevent agents from meaningfully incorporating partner input. This explores whether the training objective itself blocks collaborative reasoning.
complementary approaches to the same problem: partner-awareness
Why do models fail at asking good questions during interaction? When models must actively seek information through questions rather than receive it passively, they struggle dramatically. This explores why GPT-4o plateaus at 35% accuracy and whether training or prompting can fix the underlying deficit.
CRSA's belief tracking could explain WHY active reasoning fails: without tracking what information has been gained, questioning degenerates

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

collaborative rational speech acts extend pragmatic reasoning to multi-turn dialogue by modeling evolving beliefs of both interlocutors through rate-distortion theory

Can dialogue systems track both speakers' beliefs across turns?

Inquiring lines that read this note 73

Related concepts in this collection 2

Related papers in this collection 8

Search by related questions 4