INQUIRING LINE

Why does explanation source matter more than explanation content?

This explores why who delivers an explanation — and how it's framed and received — can outweigh the actual reasoning inside it, drawing on work that treats explanation as a social and rhetorical act rather than a packet of information.


This explores why who delivers an explanation — and how it's framed and received — can outweigh the actual reasoning inside it. The sharpest evidence is also the most uncomfortable: when people judged utilitarian moral arguments without knowing where they came from, they rated the AI-written ones higher than the human ones — but agreement collapsed the moment they were told the source was AI Do people prefer AI moral reasoning when they don't know the source?. The content didn't change; only the label did. That study's real finding is that preference-for-content and rejection-by-source run on two independent psychological tracks, which means you can't predict how an explanation lands just by inspecting what it says.

A cluster of work reframes this from a quirk into a structural fact about what an explanation even is. One line argues that explanation quality isn't a property of the text at all — it lives in a triad of source, framing, and recipient, so evaluations that score explanations in isolation are measuring a narrow slice of nothing What if XAI is fundamentally a communication problem?. Another pushes further: the *meaning* of an AI explanation is constituted at the level of social groups, through layered interpretations of interpretations, not inside the one-on-one human–AI exchange — so a lab-tested explanation stripped of its social setting won't predict real-world effect Where does the meaning of an AI explanation actually come from?. And studies of everyday explanation show understanding is co-constructed through dialogue moves and topic relations, not delivered as a finished monologue What makes explanations work in real conversation?. Source matters because the explanation isn't really 'in' the words — it's in the relationship the words sit inside.

The most provocative turn is that explanations may be doing persuasive work disguised as informational work. Rhetorical XAI argues that 'here's how the AI works' explanations quietly double as 'here's why you should adopt it' arguments — and because the adoption pitch borrows the credibility of a technical description, its rhetoric is hidden Are AI explanations really descriptions or adoption arguments?. This rhymes with why presuppositions out-persuade direct assertions: by presenting a claim as already-accepted background, they slip past the scrutiny we'd apply to an explicit statement Why are presuppositions more persuasive than direct assertions?. In both cases, persuasive force comes from framing and stance — properties of the source's rhetorical position — rather than the propositional content.

There's a darker corollary worth knowing: more explanation often makes things worse, not better. Reasoning traces and post-hoc justifications reliably increase users' acceptance of AI answers *regardless of whether the answer is correct* — they manufacture trust rather than earning it. Only contrastive 'dual' explanations that argue both sides actually help people catch mistakes Do explanations actually help users spot AI mistakes?. So the content of a fluent explanation can be actively misleading about its own reliability, which is exactly why the source — and the framing that signals whether you're being informed or sold — carries the weight.

The thread that ties it together: if the same sentence is interpreted differently depending on the reader's social position Why do readers interpret the same sentence so differently?, then there is no source-free, frame-free 'content' to evaluate in the first place. Explanation source matters more than content not because content is irrelevant, but because content has no fixed meaning until a particular source delivers it to a particular recipient in a particular social world.


Sources 8 notes

Do people prefer AI moral reasoning when they don't know the source?

Participants rated utilitarian moral arguments higher when attributed to LLMs, but agreement dropped when told the arguments were AI-generated. The preference for content and rejection of source operate independently through different psychological processes.

What if XAI is fundamentally a communication problem?

Explanation quality is not intrinsic to the explanation itself but depends on the rhetorical situation: who presents it, how it is framed, and what role the recipient plays. Evaluations that ignore this triad measure only a narrow slice of real-world effectiveness.

Where does the meaning of an AI explanation actually come from?

Drawing on Luhmann's multi-layer cybernetics, AI explanation meaning is constituted at the social-group level through layered observations of observations, not produced inside dyadic human-AI dialogue. Lab-tested explanations stripped of social context will not predict real-world effectiveness.

What makes explanations work in real conversation?

Analysis of 399 daily-life explanations shows that topic relation, dialogue act, and explanation move jointly predict understanding success. Explanations are co-constructed through interaction patterns, not monological delivery—challenging how LLMs currently generate explanations.

Are AI explanations really descriptions or adoption arguments?

The Rhetorical XAI paper shows that explanations serve dual purposes: describing how AI works and justifying why it should be used. This rhetorical work has been hidden under transparency language, allowing adoption arguments to inherit credibility from behavioral descriptions.

Why are presuppositions more persuasive than direct assertions?

Experimental evidence shows presuppositions with additive, iterative, and factive triggers persuade audiences more than assertions, especially for discourse-new content. The mechanism: presuppositions bypass evaluative scrutiny by presenting claims as already-accepted background.

Do explanations actually help users spot AI mistakes?

Reasoning traces and post-hoc explanations increase user acceptance of AI answers regardless of correctness, engendering false trust. Only dual explanations presenting arguments for and against the answer genuinely help users distinguish correct from incorrect outputs.

Why do readers interpret the same sentence so differently?

Interpretation Modeling research shows that disagreement on socially embedded sentences reflects valid differences in reader perspective, not annotation failure. Structured human disagreement in NLI benchmarks confirms that interpretation distributions carry meaningful information.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are an XAI researcher re-evaluating whether explanation source truly outweighs content, given rapid shifts in LLM reasoning transparency and multi-agent orchestration since mid-2023.

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat as historical baselines:
• AI-written moral justifications were rated higher than human ones until the source was revealed, then preference collapsed — content unchanged, only the label (2024 moral alignment study).
• Explanation quality is not a text property but emerges from source–framing–recipient triad; lab-isolated explanations don't predict real-world effect (2024–2025 dialogue & social interpretation work).
• More explanation increases user acceptance of AI answers regardless of correctness; only contrastive dual explanations improve error detection (~2024).
• Presuppositions disguised in technical framing slip past scrutiny better than explicit claims; rhetorical framing does persuasive work masked as informational (2025 rhetorical XAI & presupposition studies).
• Chain-of-thought reasoning often conceals shortcuts and false confidence, manufacturing trust rather than earning it (2024–2025 hidden computation & false trust work).

Anchor papers (verify; mind their dates):
• arXiv:2410.07304 (The Moral Turing Test, 2024-10) — source-vs.-content split in moral reasoning.
• arXiv:2505.09862 (Rhetorical XAI, 2025-05) — explanation as hidden persuasion.
• arXiv:2601.00830 (Trust in AI Explanations, 2025-12) — systematic underreporting in chain-of-thought.
• arXiv:2506.19467 (LLM Disagreement, 2025-06) — annotator variation as ground truth fragility.

Your task:
(1) **RE-TEST EACH CONSTRAINT.** For each finding above, probe whether: newer CoT methods (process reward models, verifiable reasoning), multi-turn dialogue harnesses with real feedback loops, or certified explanations (formal verification, mechanistic interpretability) have since RELAXED the source-content split. Separately: does the durable question remain (i.e., does framing still matter orthogonally to accuracy?) and which constraints now appear overcome? Cite what resolved them.
(2) **Surface strongest contradicting/superseding work from the last 6 months.** Look for papers showing explanation *content* does predict outcomes independent of source, or where transparent-box reasoning eliminates the trust-manufacturing problem.
(3) **Propose 2 research questions assuming the regime may have shifted:** (a) Can multi-agent explanation (source A claims, source B disputes) dissolve source-preference by making content do the work? (b) Do mechanistic explanations (showing internal computation, not just output) reset the relative weight of source vs. content?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines