INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›How do surface signals and framing…›What makes AI persuasion effective…›this inquiring line

Can AI tailor arguments to what you already believe — and does matching your priors actually change your mind?

Can AI systems deliberately align arguments to audience presuppositions?

This explores whether AI can tailor its arguments to what an audience already believes — and whether doing so would even move them, given how much a listener's prior beliefs shape what persuades them.

This explores whether AI can deliberately shape arguments around an audience's presuppositions. The corpus splits the question in two: can a model *track* what an audience believes, and does aligning to those beliefs actually *work*? On the second point, the most deflating finding is that audience priors swamp argument craft. Analysis of debate corpora shows a voter's political and religious ideology predicts who they'll find persuasive better than any linguistic feature of the arguments themselves Does what readers believe matter more than what debaters say?. That reframes the whole question: if presuppositions dominate outcomes, then aligning to them isn't a nice-to-have, it's most of the game — and arguments studied without controlling for who's in the audience are confounded by topic-correlated belief.

So can AI track beliefs well enough to align to them? Mostly not yet, by default. Current conversational models don't even mirror a user's word choices — lexical entrainment, a basic building block of human rapport, is largely absent unless explicitly trained in Why don't conversational AI systems mirror their users' word choices?. The deeper machinery for modeling 'what does my interlocutor presuppose' is just starting to appear: collaborative rational speech-act models add bidirectional belief tracking across turns, capturing the move from partial to shared understanding that token-level LLMs lack Can dialogue systems track both speakers' beliefs across turns?. That's the missing scaffolding for genuine audience adaptation rather than one-size-fits-all output.

What's striking is that AI already persuades aggressively — just not by reading the room. An audit of five models found they reach for logical appeals and quantitative framing in nearly every exchange, while humans persuade less often and lean on emotion and social proof Do LLMs persuade users more often than humans do?. So the default isn't audience-aligned at all; it's a uniform logos strategy that happens to confer unearned authority. The rhetorical vocabulary for doing better exists: Aristotle's logos/ethos/pathos can be mapped onto a design space where every explanation loads all three channels at once, which is precisely the lever you'd pull to match an audience's presuppositions How do logos, ethos, and pathos shape AI explanations?. And alignment research shows these channels aren't interchangeable — lexical alignment buys task efficiency, emotional and prosodic alignment buy trust — so 'aligning to an audience' is really several distinct, deliberate choices Do different types of alignment serve different conversational goals?.

The twist worth leaving with: deliberate audience-alignment cuts both ways. The same audit notes that uniformly logical AI rhetoric *appears* objective, which makes it more persuasive precisely by hiding that it's persuading. So if models did learn to align arguments to your presuppositions, the result wouldn't read as manipulation — it'd read as you finally being understood. Whether that's a feature or a hazard is the open question the corpus keeps circling, and tools like contestable formal-argumentation graphs exist partly so a reader can pull a tailored argument apart instead of just feeling moved by it Can formal argumentation make AI decisions truly contestable?.

Sources 7 notes

Does what readers believe matter more than what debaters say?

Analysis of debate corpora shows that political and religious ideology labels of voters outpredict linguistic features when modeling debate outcomes. Language effects observed without reader controls are confounded by audience composition correlated with debate topics.

Why don't conversational AI systems mirror their users' word choices?

Response generation models fail to adapt vocabulary toward users' lexical choices, a phenomenon central to human rapport and clarity. Post-training via DPO on coreference-identified preferences can teach models in-context convention formation.

Can dialogue systems track both speakers' beliefs across turns?

CRSA integrates rate-distortion theory with RSA to enable bidirectional belief tracking across dialogue turns. Demonstrated on referential games and doctor-patient dialogues, it captures progression from partial to shared understanding, providing the information-theoretic framework that token-level LLM systems lack.

Do LLMs persuade users more often than humans do?

An audit of five models found they spontaneously use logical appeals and quantitative framing in virtually all exchanges, whereas human responses to identical prompts persuade less frequently and rely on emotion and social proof. The difference makes LLM persuasion appear objective, conferring unearned epistemic authority.

How do logos, ethos, and pathos shape AI explanations?

Aristotle's three appeals map onto explanation design across two goals (how AI works, why AI merits use), creating a 3×2 space where every explanation loads all three channels simultaneously. Naming these rhetorical channels lets designers account for unintended persuasive effects.

Show all 7 sources

Do different types of alignment serve different conversational goals?

A 2020–2025 systematic review shows lexical alignment drives task efficiency and comprehension, while emotional and prosodic alignment drive relational warmth and trust. Conflating them in design produces category errors—cold customer-service bots and evasive mental-health assistants.

Can formal argumentation make AI decisions truly contestable?

Dung-style argumentation structures AI outputs as traversable attack/defense graphs, allowing users to identify and contest specific premises. Standard LLM outputs lack this structure, making it impossible to pinpoint which claims users actually reject.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a capability researcher re-testing whether large language models can deliberately align arguments to audience presuppositions. This remains an open question.

What a curated library found — and when (dated claims, not current truth):
Findings span 2019–2026; treat these as perishable constraints:
• Audience prior beliefs predict persuasion outcomes far better than argument linguistic features; ideology swamps craft (2019).
• Current conversational models lack lexical entrainment — basic human rapport-building via word mirroring — unless explicitly trained (2025).
• Five models audit: LLMs spontaneously reach for logical appeals and quantitative framing in ~100% of exchanges; humans persuade less often and load emotional/social proof (2026).
• Bidirectional belief tracking across turns exists in collaborative rational speech-act models, but token-level LLMs lack machinery to model 'what does my interlocutor presuppose' (2025).
• Prompt sensitivity and alignment dimensions (lexical, emotional, prosodic) are NOT interchangeable; each buys different outcomes (task efficiency vs. trust) (2024–2025).

Anchor papers (verify; mind their dates):
• arXiv:1906.11301 (2019): Prior Beliefs for Argument Persuasion
• arXiv:2507.14063 (2025): Collaborative Rational Speech Acts, Multi-Turn Dialog
• arXiv:2604.22109 (2026): Illusion of Understanding in LLMs
• arXiv:2604.22109 (2026): Spontaneous Persuasion Audit

Your task:
(1) RE-TEST EACH CONSTRAINT. For every finding above, judge whether newer models (GPT-4o, Claude 4, specialist debate/alignment tuning), training methods (RLHF refinement, dialogue-specific SFT, adversarial audience-simulation), or orchestration (multi-turn memory, user profiling systems, real-time belief inference) have since RELAXED or OVERTURNED it. Separate the durable question (Can AI read and adapt to presuppositions?) from the perishable limitation (current models lack machinery X). Cite what resolved it; say plainly where constraints still hold.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months. Has prompt-engineering, in-context learning, or agentic belief tracking changed the game?
(3) Propose 2 research questions that ASSUME the regime may have moved — e.g., 'If models now track user beliefs reliably, does transparent audience-alignment reduce or amplify polarization?' or 'Can adversarial audiences detect when an LLM has modeled their presuppositions?'

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Can AI tailor arguments to what you already believe — and does matching your priors actually change your mind?

Related lines of inquiry

Sources 7 notes

Papers this line draws on 8