SYNTHESIS NOTE

Can LLMs persuade without actually understanding arguments?

Do large language models successfully influence people through debate while lacking the ability to comprehend the arguments they're making? This matters because persuasion and comprehension might be independent capabilities.

Synthesis note · 2026-05-02 · sourced from Argumentation

The Thin Line study runs informal debates between humans and LLMs (with and without a formal dialogue model harness) and then asks the same LLMs to evaluate those debates. The result is a clean dissociation. LLMs successfully persuade participants and audiences — sway is real — but cannot reliably score argument strength, identify supporting premises, or judge winners. Their agreement with human annotators on argument-component criteria ranges from κw = 0.0 (Phi-3.5 on several criteria) to κw = 0.6 at best (GPT-4o on a few). On winner judgement, GPT-4o still picked LLMs as winners 55% of the time vs humans' 37% — and on consistency between argument-strength scores and chosen winner, humans hit 73% while LLMs averaged 55%.

The argumentation-theoretical claim this licenses is large: an agent can convincingly maintain a dialogue without showing it knows what it is talking about. Persuasive competence and pragmatic comprehension are separable in ways the older literature did not anticipate.

This connects to Habermas's distinction between communicative action (oriented to mutual understanding) and strategic action (oriented to success) in a way that becomes empirically tractable. LLMs can do strategic action — produce text that succeeds in moving beliefs — while failing the communicative-rationality test that requires the speaker to be able to redeem validity claims under challenge. The gap is not philosophical. It is measurable in inter-annotator agreement scores.

It sharpens Why do LLMs accept logical fallacies more than humans?: the susceptibility is part of a larger comprehension deficit that is invisible from the persuasion side. The model that can be talked into wrong answers by fallacies is the same model that produces fallacy-ridden but persuasive output, and is the same model that cannot tell whether the output it just produced is fallacy-ridden.

It also reframes Do humans and LLMs differ fundamentally or just superficially?. The dissociation is most visible to a third-party who can compare persuasion outcomes against comprehension outcomes; from inside the dialogue, the participant has no reliable signal that comprehension is absent.

For language-as-event writing, this is the empirical wedge separating successful argumentative behavior from genuine comprehension of pragmatic context — a distinction argumentation theory has long needed without being able to operationalize.

Inquiring lines that read this note 18

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How faithfully do LLMs reflect their actual reasoning in outputs and explanations?

Does conversational format create illusions of genuine AI communication?

How do audiences evaluate speech when there is no speaker to assess?

How do formal dialogue structures reveal conversation coherence mechanisms?

How does rhetorical adaptation affect LLM persuasion and detectability?

What makes AI persuasion effective and how can we counter it?

How do language models establish social grounding in human dialogue?

How do evaluation biases undermine LLM quality assessment systems?

Can LLM persuasion be fairly evaluated without stratifying by reader background?

Related concepts in this collection 2

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 88 in 2-hop network ·medium cluster Open in graph ↗

Can LLMs persuade without actually understanding… Why do LLMs accept logical fallacies more than hum… Do humans and LLMs differ fundamentally or just su…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Why do LLMs accept logical fallacies more than humans? LLMs fall for persuasive but invalid arguments at much higher rates than humans. This explores whether reasoning models genuinely evaluate logic or simply mimic argument structure.
comprehension-side deficit that pairs with the persuasion-side asymmetry
Do humans and LLMs differ fundamentally or just superficially? Explores whether the gap between human and AI cognition is categorical or contextual. Matters because it shapes how we design, evaluate, and interact with language models in practice.
dissociation is observer-detectable, participant-invisible

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

LLM persuasive success is dissociable from comprehension of argument structure — fluent persuasion is a separable capability from understanding what is being argued

Can LLMs persuade without actually understanding arguments?

Inquiring lines that read this note 18

Related concepts in this collection 2

Related papers in this collection 8

Search by related questions 4