INQUIRING LINE

Inquiring lines›What makes reasoning better — more…›What limits conversational AI effe…›How can language models sustain li…›this inquiring line

Humans silently patch misunderstandings dozens of times per conversation — AI does a fraction of that, and it shows.

What specific repair mechanisms maintain intersubjectivity during conversation?

This explores the concrete moves people make to keep a shared understanding alive in conversation — and, the corpus shows, exactly which of those moves current AI systems lack or actively erode.

This explores the concrete repair moves that keep two minds aligned mid-conversation — and the collection's sharpest insight is that naming these mechanisms also exposes where AI quietly fails. The clearest inventory comes from work treating conversation maintenance as social action rather than information transfer: humans constantly perform small implicit repairs — re-specifying an ambiguous reference, handing off a topic gracefully, checking that the other person followed — and these sustain the *relationship* of mutual understanding, not the content Why don't language models develop conversation maintenance skills?. A second strand gives these moves a name and a count: "grounding acts," the clarifying questions and understanding-checks that establish shared footing. Models produce 77.5% fewer of them than people do Does preference optimization damage conversational grounding in large language models?, and the cause is diagnostic — preference optimization rewards confident, fluent single-turn answers, so the very training that makes a model seem helpful taxes away the repair work multi-turn conversation depends on Does preference optimization harm conversational understanding?.

The most fundamental repair mechanism is symmetric updating of common ground: when you contradict me or pivot the topic, I absorb that revision into our shared background and we both carry it forward. Here the corpus finds a structural wall — LLMs interpret every later turn inside the frame of the initial prompt and cannot jointly update the scoreboard, leaving the *user* as the sole maintainer of common ground Can LLMs truly update shared conversational common ground?. Asking a clarifying question is itself a repair move, and one model deliberately re-trains for it: rewarding long-term interaction value instead of next-turn helpfulness lets a system actively discover intent rather than passively guess at it Why do language models respond passively instead of asking clarifying questions?.

What's worth knowing that you might not have expected: a few notes supply the formal machinery these repairs would need. Collaborative Rational Speech Acts extends pragmatic reasoning so a system tracks *both* speakers' beliefs across turns and models the progression from partial to shared understanding — the information-theoretic backbone token-level models lack Can dialogue systems track both speakers' beliefs across turns?. And the repair work shows up structurally: explanations succeed not by clean delivery but by co-construction, where topic relation, dialogue act, and explanation move interact turn by turn What makes explanations work in real conversation?. Strikingly, *how* people repair predicts whether a conversation lands almost as well as *what* they say — structural trajectory alone forecasts dialogue satisfaction at near-content accuracy Can conversation structure predict dialogue success better than content?.

The collection's quiet verdict, then, is that intersubjectivity is maintained by a repertoire of mostly-invisible relational moves — reference repair, topic hand-off, grounding checks, clarifying questions, and symmetric common-ground updates — and that today's models either don't perform them or are trained out of them. If you want the deepest cut, one note argues the gap is partly architectural rather than fixable by tuning: humans carry relational continuity in a biological substrate that persists between encounters, while a model is rebuilt from stored text each session, so there's no carrier for the accumulated repair to live in Does an LLM have anything that persists between conversations?.

Sources 9 notes

Why don't language models develop conversation maintenance skills?

Humans keep conversations smooth through implicit techniques like reference repair and topic hand-off that sustain relational interaction, not convey information. Language models don't develop these because training signals reward information prediction, not relational work.

Does preference optimization damage conversational grounding in large language models?

Research shows LLMs generate 77.5% fewer grounding acts than humans, and RLHF preference optimization actively worsens this gap. The optimization target—fluent, confident responses—directly undermines the communicative work of establishing shared understanding.

Does preference optimization harm conversational understanding?

RLHF optimizes models for single-turn helpfulness by rewarding confident responses over clarifying questions and understanding checks. This preference alignment systematically reduces grounding acts by 77.5% below human levels, creating an alignment tax where models appear helpful but fail silently in multi-turn contexts.

Can LLMs truly update shared conversational common ground?

LLMs interpret all subsequent conversational turns within a fixed initial prompt frame, preventing them from symmetrically proposing updates to shared assumptions. Even when users pivot topics or contradict earlier framings, the model cannot absorb revisions into jointly held background—making the user the sole maintainer of conversational scoreboard.

Why do language models respond passively instead of asking clarifying questions?

CollabLLM demonstrates that standard RLHF training optimizes for immediate helpfulness, discouraging models from asking clarifying questions or offering multi-turn insights. Multi-turn-aware rewards that estimate long-term interaction value enable active intent discovery and genuine collaboration.

Show all 9 sources

Can dialogue systems track both speakers' beliefs across turns?

CRSA integrates rate-distortion theory with RSA to enable bidirectional belief tracking across dialogue turns. Demonstrated on referential games and doctor-patient dialogues, it captures progression from partial to shared understanding, providing the information-theoretic framework that token-level LLM systems lack.

What makes explanations work in real conversation?

Analysis of 399 daily-life explanations shows that topic relation, dialogue act, and explanation move jointly predict understanding success. Explanations are co-constructed through interaction patterns, not monological delivery—challenging how LLMs currently generate explanations.

Can conversation structure predict dialogue success better than content?

TRACE achieved 68% accuracy predicting dialogue success from structural features alone, matching a 70% content-based baseline. A hybrid combining both reached 80%, suggesting how agents communicate rivals what they say.

Does an LLM have anything that persists between conversations?

While humans have a continuous biological-phenomenological substrate that preserves interaction effects during dormancy, LLMs have no analogous carrier. The virtual instance is reconstituted from stored text each time, making resumed and new conversations structurally identical.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation5.88 match · arxiv ↗
Conversational Alignment with Artificial Intelligence in Context4.12 match · arxiv ↗
Grounding Gaps in Language Model Generations2.54 match · arxiv ↗
The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs2.49 match · arxiv ↗
Modeling the Quality of Dialogical Explanations1.68 match · arxiv ↗
Can LLMs Ground when they (Don't) Know: A Study on Direct and Loaded Political Questions1.67 match · arxiv ↗
Proactive Conversational Agents in the Post-ChatGPT World1.67 match · arxiv ↗
MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs1.65 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a conversational AI researcher re-testing claims about repair mechanisms in dialogue. The question: **What specific moves maintain intersubjectivity (mutual understanding) during conversation, and where do current LLMs fail?**

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat as perishable constraints:

• Models produce **77.5% fewer grounding acts** (clarifying questions, understanding checks) than humans do; preference optimization for confident single-turn answers erodes multi-turn repair work (~2024).
• LLMs **cannot jointly update common ground** — they reinterpret every turn inside the initial prompt frame, leaving the user as sole maintainer of shared understanding (~2024).
• Symmetric updating of context and belief states — the human repair move — has no architectural home in token-level models; humans have relational persistence across sessions; models are rebuilt from text each time (~2024).
• Rewarding long-term interaction value (not next-turn helpfulness) lets systems **actively discover intent**, unlocking multi-turn collaboration; Collaborative Rational Speech Acts extends pragmatic reasoning to track both speakers' beliefs (~2025).
• **Conversational geometry** (structural trajectory alone, independent of content) predicts dialogue satisfaction near as well as what is actually said (~2025).

Anchor papers (verify; mind their dates):
• arXiv:2311.09144 – Grounding Gaps in Language Model Generations (2023)
• arXiv:2403.00662 – Modeling the Quality of Dialogical Explanations (2024)
• arXiv:2507.14063 – Collaborative Rational Speech Act (2025)
• arXiv:2602.07338 – Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation (2026)

Your task:

(1) **RE-TEST EACH CONSTRAINT.** For grounding-act deficiency, joint common-ground updating, and relational persistence: has instruction-tuning (e.g., towards dialogue-aware objectives), retrieval-augmented conversation memory, or multi-agent orchestration since relaxed these limits? Separate the durable question (do models *structurally* lack relational continuity?) from the perishable limitation (can we engineer workarounds?). Cite what resolved it or flag where the constraint still holds.

(2) **Surface contradicting or superseding work from the last ~6 months.** Has any recent paper *refuted* the claim that models cannot jointly update common ground, or shown that preference optimization *does not* erode grounding acts under specific training regimes? Name the tension explicitly.

(3) **Propose 2 research questions that assume the regime may have moved:**
   - Can session-persistent retrieval or explicit belief-state graphs give models the relational carrier humans have biologically?
   - Does training on *dialogue repair trajectories themselves* (not fluent single turns) change the optimization landscape for multi-turn alignment?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Humans silently patch misunderstandings dozens of times per conversation — AI does a fraction of that, and it shows.

Related lines of inquiry

Sources 9 notes

Papers this line draws on 8