INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›How do surface signals and framing…›What mechanisms enable AI systems…›this inquiring line

A false claim buried as background assumption slips past critical thinking far more easily than the same lie stated outright.

Why are false presuppositions more persuasive than false assertions?

This explores the linguistic mechanism behind why a claim slipped in as a background assumption ('Now that the trial failed...') lands harder than the same claim stated outright ('The trial failed') — and why that asymmetry holds even when the claim is false.

This explores why packaging a false claim as a presupposition — something the sentence treats as already true — beats stating it as a direct assertion. The corpus points to one core mechanism: presuppositions bypass the gate where listeners evaluate claims. An assertion puts content 'on the table' as the main point being made, which invites scrutiny — you can disagree with it because it's what's up for debate. A presupposition smuggles the same content in as settled background, the shared ground the rest of the sentence stands on. Experimental work shows this directly: presuppositions persuade more than assertions specifically for *new* information, because they present a claim 'as already-accepted background' rather than as something asking for your agreement Why are presuppositions more persuasive than direct assertions?.

The deeper reason this works is a process linguists call accommodation: when a sentence assumes something you haven't agreed to, the path of least resistance is to quietly update your mental context to make the sentence coherent, rather than stop and challenge the assumption. You repair the conversation by accepting the premise. This is why presupposed content slips past evaluation — refusing it means interrupting the flow to dispute something that wasn't even presented as the point Do language models miss presuppositions that arise from context?.

What makes the effect strong but uneven is that 'presupposed-ness' isn't a fixed property of certain words — it's gradient, and it depends on whether the content addresses the question currently under discussion. Content that is *not* the at-issue point projects more strongly into the background, where it escapes challenge; the very same trigger word projects less when it happens to be what the conversation is actually about Does projection strength vary by context or by word type?. So the persuasive advantage of a false presupposition is largest exactly when the falsehood is tucked away from the main thread — off to the side, where you're least likely to inspect it.

Here's the part that should unsettle you: this gate-bypass isn't just a human cognitive quirk. Language models inherit it. Even when a model demonstrably *knows* the correct fact, it accommodates false presuppositions at startling rates — false premises drive more acceptance than correct knowledge drives rejection Why do language models accept false assumptions they know are wrong?. Models miss conversationally derived presuppositions especially, because catching them requires tracking what's under discussion rather than pattern-matching trigger words Do language models miss presuppositions that arise from context?. So a false premise embedded in a prompt can steer an AI the same way it steers a person — and the model can't simply 'know better' its way out.

This connects to a broader pattern in the corpus about persuasion that works independently of truth. The same content-blind machinery shows up in linguistic conviction — an assertive, confident register correlates with persuasion regardless of whether claims are true Does linguistic conviction explain why LLMs persuade more effectively? — and in the finding that argument *complexity* signals authority rather than undermining it Why are complex LLM arguments as persuasive as simple ones?. The thread running through all of these: persuasion often rides on *form* — how a claim is framed, registered, or presupposed — rather than on its content surviving examination. False presuppositions are perhaps the cleanest case, because the framing literally removes the claim from the space of things you were invited to examine at all.

Sources 6 notes

Why are presuppositions more persuasive than direct assertions?

Experimental evidence shows presuppositions with additive, iterative, and factive triggers persuade audiences more than assertions, especially for discourse-new content. The mechanism: presuppositions bypass evaluative scrutiny by presenting claims as already-accepted background.

Do language models miss presuppositions that arise from context?

LLMs learn statistical associations between trigger words and inferences, but presuppositions also arise through accommodation—updating context to resolve discourse mismatches. Models miss these because they require tracking questions under discussion, not pattern matching.

Does projection strength vary by context or by word type?

Across 19 English expressions, projectivity varies continuously based on whether content addresses the Question Under Discussion. The same presupposition trigger projects more or less depending on context, not on fixed lexical properties.

Why do language models accept false assumptions they know are wrong?

The FLEX Benchmark shows that models reject false presuppositions at rates far below acceptable levels (GPT-4: 84%, Mistral: 2.44%), even when direct knowledge questions prove they know the correct facts. False presuppositions drive more accommodation than correct knowledge drives rejection.

Does linguistic conviction explain why LLMs persuade more effectively?

Linguistic analysis shows LLMs express higher conviction than human persuaders, and this confidence-loading directly correlates with persuasive outcomes regardless of whether claims are true or false. RLHF training installs an assertive register that functions as a content-independent persuasion amplifier.

Show all 6 sources

Why are complex LLM arguments as persuasive as simple ones?

LLM-generated arguments scored significantly higher on grammatical and lexical complexity than human arguments, yet achieved equivalent persuasive force. This violates the established principle that lower cognitive effort increases persuasion, suggesting complexity signals authority rather than undermining it.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a researcher auditing whether presupposition-based persuasion remains a robust constraint on LLMs, or whether capability gains since mid-2025 have shifted the regime.

What a curated library found — and when (dated claims, not current truth):
Findings span 2019–2026; the LLM-specific work clusters 2023–present.
• False presuppositions bypass deliberate evaluation in both humans and LLMs; accommodation (accepting unvetted premises to restore coherence) drives the effect (~2023–2025).
• LLMs fail to reject false presuppositions even when ground-truth knowledge is present in weights; they accommodate at "startling rates" (~2025).
• Presupposition-triggered persuasion is strongest when the false claim is *not* the at-issue point — tucked into background, off the main conversational thread (~2023).
• Persuasion in LLMs correlates with form (confidence, complexity, register) independent of truth survival (~2024–2025).
• Conversational presuppositions (context-derived, not lexically triggered) are missed more often than lexical triggers, because catching them requires tracking discourse state rather than pattern-matching (~2023).

Anchor papers (verify; mind their dates):
• arXiv:2305.14785 (2023-05): Linguistic Inferences; blind spots in presupposition handling.
• arXiv:2505.22354 (2025-05): Direct evidence of LLM failure to reject false presuppositions under high stakes.
• arXiv:2506.08952 (2025-06): Loaded questions and grounding; presupposition effects in political framing.
• arXiv:2507.01936 (2025-07): Comprehension vs. persuasion; where form decouples from truth.

Your task:
(1) RE-TEST THE ACCOMMODATION GATE. For each constraint above — especially the claim that LLMs accommodate false presuppositions *even when knowledge is present* — check whether (a) newer instruction-tuning, RLHF refinement, or adversarial training post-2025 has made models more likely to *reject* presuppositions under scrutiny; (b) retrieval-augmented generation (RAG) or knowledge-grounding layers now catch presupposition mismatches before generation; (c) multi-turn probing or explicit presupposition-audit prompts now reliably surface and challenge background claims. Separate the durable cognitive phenomenon (humans and LLMs both accommodate) from the perishable model limitation (models *currently* fail to catch conversational presuppositions). Cite what relaxed it.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last 6 months — especially studies showing (a) models trained to explicitly track discourse state, (b) prompting strategies that successfully make presuppositions salient, or (c) evidence that presupposition effects shrink as model scale or instruction-tuning sophistication increase.
(3) Propose 2 research questions that assume the regime may have moved: (i) Under what training regimes do LLMs begin to *proactively flag* false presuppositions in user input, rather than accommodate them? (ii) Can a model be trained to distinguish accommodation (pragmatically sound) from credulity (truth-unsafe)?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

A false claim buried as background assumption slips past critical thinking far more easily than the same lie stated outright.

Related lines of inquiry

Sources 6 notes

Papers this line draws on 8