SYNTHESIS NOTE
Language, Text, and Discourse

Do linguistic features of persuasion stay the same across audiences?

When researchers study what language makes arguments persuasive, do they account for who is listening? Without controlling for reader beliefs, do findings about persuasive language actually reflect audience effects instead?

Synthesis note · 2026-05-18 · sourced from Argumentation
Where exactly do LLMs break down with language structure? How do people build trust with conversational AI?

The same debate corpus produces two different stories about which linguistic features drive persuasion, depending on whether reader-level controls are included. Without controls — the standard NLP setup — one set of linguistic features emerges as predictive. With political and religious ideology controls — the controlled setup — a different set emerges. The features themselves are not stable across specifications.

This is a stronger result than "reader factors also matter." It says the standard specification produces a biased picture of which language features cause persuasion. Some features that appear predictive without controls are proxies for audience-text matching; their predictive power evaporates once you account for who is in the audience. Other features only emerge as predictive once audience composition is held constant — they are real but hidden by the noise that audience heterogeneity introduces.

The methodological consequence is that the language-of-persuasion literature needs a re-read. Many findings about which words, which moves, which features make arguments more persuasive were estimated on debate corpora without reader controls. Some of those findings are likely artifacts of audience composition rather than language effects. Replicating them under reader-level controls is the cheap empirical correction.

The best-performing model in this study combines reader features and linguistic features — neither alone suffices. This is the operational conclusion: language matters, audience matters, and they interact. Modeling either in isolation misses the joint structure.

For LLM persuasion evaluation specifically, the design implication is to stratify by reader ideology when measuring stance shift. Aggregate numbers conflate ideology-congruent and ideology-opposed effects in a way that obscures the actual mechanism.

Inquiring lines that use this note as a source 13

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
12 direct connections · 72 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

the most-predictive linguistic features of persuasion shift once reader prior beliefs are controlled — NLP studies of persuasion that omit reader-level factors are confounded