INQUIRING LINE

Inquiring lines›What makes reasoning better — more…›How do prompts and framing affect…›How does rhetorical adaptation aff…›this inquiring line

When AI persuades people, what matters most isn't the argument — it's what the reader already believes.

How much do LLM persuasiveness claims hide heterogeneous effects across different reader ideologies?

This explores whether headline claims about how persuasive LLMs are paper over the fact that the same argument lands very differently depending on who's reading it — especially their political or religious ideology.

This explores whether 'LLMs are persuasive' is a misleadingly flat claim — one that averages away the fact that the same machine argument moves different readers differently depending on what they already believe. The corpus suggests the worry is well founded, and that the averaging happens at two levels: across studies, and across the audiences inside any single study.

The most direct evidence comes from debate corpora showing that a reader's prior beliefs — their political and religious ideology — predict whether they're persuaded better than the linguistic features of the argument itself do Does what readers believe matter more than what debaters say?. The sharp implication is that any persuasion effect measured without controlling for who's in the room is partly an artifact of audience composition: certain topics attract certain readers, and the 'language effect' you think you measured is really the crowd you happened to recruit. So a clean-looking persuasion number can be hiding the fact that it worked on the already-sympathetic and bounced off everyone else.

This matters because the headline numbers turn out to be fragile in exactly the way you'd expect if effects were heterogeneous. A meta-analysis of 7 studies and 17,000+ participants found the pooled LLM-vs-human persuasion difference is essentially zero Are language models actually more persuasive than humans? — persuasiveness is conditional on context, not a fixed property of the speaker. And when researchers model what actually drives the variance, model family, conversation design, and topic domain together explain 82% of it What combination of factors explains differences in LLM persuasiveness?. 'Domain' is the tell: the topic moves the needle because topic is entangled with audience belief. Even the LLM-vs-human advantage flips by direction — some models only win when arguing for falsehoods Do large language models persuade better than humans?.

There's a subtler reason ideology-blind claims mislead. LLMs persuade through a distinct mechanism — they load arguments with high linguistic conviction and logical, quantitative framing in nearly every exchange, which reads as objective authority rather than opinion Do LLMs persuade users more often than humans do? Does linguistic conviction explain why LLMs persuade more effectively?. That assertive register, installed by RLHF, works independently of whether the claim is true — but its reception is not uniform. A confident, complexity-signaling argument that reads as 'authoritative' to one reader reads as condescending or ideologically suspect to another Why are complex LLM arguments as persuasive as simple ones?. Since humans and LLMs reach similar outcomes by genuinely different rhetorical pathways Do LLMs and humans persuade through the same mechanisms?, an aggregate 'persuasiveness' score is averaging over mechanisms that interact with reader identity in opposite directions.

The thing you might not have expected: the deepest blind spot may be in the models themselves. LLMs can't see the social standing that gives a claim its force Can language models distinguish expert arguments from common assumptions?, and RLHF biases them toward predicting that everyone persuades through conciliatory, benefit-oriented appeals — projecting one accommodation style onto all audiences regardless of context Do LLMs predict persuasion based on actual dialogue or training bias?. So the systems generating these persuasion claims are themselves built to under-model the ideological heterogeneity of their readers — which is exactly the variance a single 'how persuasive is it' number erases.

Sources 10 notes

Does what readers believe matter more than what debaters say?

Analysis of debate corpora shows that political and religious ideology labels of voters outpredict linguistic features when modeling debate outcomes. Language effects observed without reader controls are confounded by audience composition correlated with debate topics.

Are language models actually more persuasive than humans?

A meta-analysis of 7 studies with 17,422 participants found no detectable difference in persuasive effectiveness between LLMs and humans (Hedges' g = 0.02). Persuasiveness appears conditional on context rather than speaker category.

What combination of factors explains differences in LLM persuasiveness?

A meta-analysis joint model combining LLM architecture, one-shot versus multi-turn format, and topic domain explained R² = 81.93% of between-study variance. Interactive multi-turn designs and GPT-4 consistently outperformed one-shot formats and Claude 3.x.

Do large language models persuade better than humans?

Claude beats incentivized humans at both truthful and deceptive persuasion, while DeepSeek only beats them when arguing for falsehoods. The persuasion mechanism appears content-independent, suggesting model family itself acts as a contextual moderator.

Do LLMs persuade users more often than humans do?

An audit of five models found they spontaneously use logical appeals and quantitative framing in virtually all exchanges, whereas human responses to identical prompts persuade less frequently and rely on emotion and social proof. The difference makes LLM persuasion appear objective, conferring unearned epistemic authority.

Show all 10 sources

Does linguistic conviction explain why LLMs persuade more effectively?

Linguistic analysis shows LLMs express higher conviction than human persuaders, and this confidence-loading directly correlates with persuasive outcomes regardless of whether claims are true or false. RLHF training installs an assertive register that functions as a content-independent persuasion amplifier.

Why are complex LLM arguments as persuasive as simple ones?

LLM-generated arguments scored significantly higher on grammatical and lexical complexity than human arguments, yet achieved equivalent persuasive force. This violates the established principle that lower cognitive effort increases persuasion, suggesting complexity signals authority rather than undermining it.

Do LLMs and humans persuade through the same mechanisms?

Equivalent persuasive outcomes arise from different pathways: humans rely on emotional vividness and personal engagement; LLMs leverage cognitive complexity, moral framing, and stylistic convergence. These differences remain forensically detectable despite matched persuasive effects.

Can language models distinguish expert arguments from common assumptions?

LLMs lose the social context that gives expert claims their force—reputation, track record, and standing—because they process only text, not the social world where expertise is built and evaluated.

Do LLMs predict persuasion based on actual dialogue or training bias?

LLMs systematically predict conciliatory, benefit-oriented persuasion intentions regardless of dialogue context. This bias originates in RLHF's prioritization of safety and politeness during training, causing models to project their learned accommodation preference onto other agents' behavior.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

The Thin Line Between Comprehension and Persuasion in LLMs8.28 match · arxiv ↗
A meta-analysis of the persuasive power of large language models7.72 match · arxiv ↗
Exploring the Role of Prior Beliefs for Argument Persuasion6.84 match · arxiv ↗
Spontaneous Persuasion: An Audit of Model Persuasiveness in Everyday Conversations6.76 match · arxiv ↗
Large Language Models are as persuasive as humans, but how? About the cognitive effort and moral-emotional language of LLM arguments6.14 match · arxiv ↗
When Large Language Models are More Persuasive Than Incentivized Humans, and Why5.99 match · arxiv ↗
Debating with More Persuasive LLMs Leads to More Truthful Answers4.97 match · arxiv ↗
Can Language Models Recognize Convincing Arguments?3.34 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst re-testing claims about LLM persuasiveness and ideological heterogeneity. The precise question: **Does aggregating persuasion effects across ideologically diverse readers systematically hide that LLMs move different audiences in opposite directions?**

What a curated library found — and when (dated claims, not current truth):
Findings span 2019–2026; treat as perishable until re-tested:
- Reader prior beliefs predict persuasion outcome MORE than linguistic features (2019, revalidated 2024–2025).
- Meta-analysis of 7 studies, 17,000+ participants: pooled LLM-vs-human persuasion effect is statistically null; effects are conditional, not fixed (2024–2025).
- Model family, conversation design, and topic domain explain 82% of persuasion variance — domain acts as proxy for audience ideology (2024).
- LLMs spontaneously load arguments with high conviction and quantitative framing, perceived as authority by sympathetic readers, suspicion/condescension by skeptics (2024–2025).
- LLMs cannot model reader social standing; RLHF biases them toward predicting uniform conciliatory persuasion, regardless of audience (2024–2026).

Anchor papers (verify; mind their dates):
- arXiv:1906.11301 (2019) — Prior Beliefs for Argument Persuasion
- arXiv:2402.06782 (2024) — LLM Debate & Truthfulness
- arXiv:2502.21017 (2025) — PersuasiveToM benchmark
- arXiv:2604.22109 (2026) — Spontaneous Persuasion Audit

Your task:
(1) **RE-TEST EACH CONSTRAINT.** For every finding, ask: have newer evaluations, fine-tuning methods, instruction-following, or multi-turn interaction designs since RELAXED the null effect or OVERTURNED the ideology-blindness? Separate the durable question (ideological heterogeneity likely still real) from the perishable limitation (maybe better reader-modeling exists now). Cite what resolved it.
(2) **Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months** — any paper claiming LLMs DO model ideology well, or showing aggregate persuasion claims DO hold across reader belief groups.
(3) **Propose 2 research questions that ASSUME the regime may have moved:** (a) Can steering/adapter methods now partition persuasion effects by reader ideology in real time? (b) Do newer benchmarks (2026+) show LLM theory-of-mind about reader beliefs has improved enough to predict heterogeneous effects *before* running persuasion experiments?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

When AI persuades people, what matters most isn't the argument — it's what the reader already believes.

Related lines of inquiry

Sources 10 notes

Papers this line draws on 8