Does AI persuasiveness fade across repeated conversations with the same person?
Does the persuasive edge LLMs show in initial encounters hold up over time? Understanding whether and why AI persuasion decays with exposure matters for assessing manipulation risk across different interaction lengths.
In Schoenegger's repeated-rounds design, the persuasive edge enjoyed by Claude 3.5 Sonnet and DeepSeek v3 over incentivized humans eroded over time, while human persuaders' effectiveness held steady. This is the inverse of a habituation curve in human-to-human persuasion, where rapport often increases persuasive efficacy across exposures. With LLMs, the more turns a persuadee spends with the model, the less it sways them.
Two interpretations are compatible with the data, and they have different design consequences. One is mechanism-noticing: with more exposure, persuadees pick up on stylistic tells (the conviction-loading documented elsewhere in the same paper, the formulaic argument structures) and discount them. The other is content-thinness: the model has a finite repertoire of moves on a given question, and once a persuadee has seen them, additional iterations add no new persuasive material. The first explanation predicts decay even on novel topics; the second predicts decay primarily on repeated topics. The published results do not yet adjudicate.
Either way, the operational implication is sharp. AI persuasion is most dangerous in single-encounter contexts: one-shot political ads, cold marketing, first reads of a generated article, single-pass content moderation messages. Sustained interaction is partially self-correcting. This inverts a common assumption — that long conversations with AI are where manipulation lives — and locates the threat instead in low-engagement consumption.
This sharpens Where does AI's persuasive power actually come from?: the post-training levers that boost persuasiveness operate against a baseline that itself decays under exposure. So the asymmetry between LLM and human persuasion is largest at first contact and narrows from there.
For media-design writing, this lines up with an emerging picture: AI's distinctive persuasive footprint is in skim-and-scroll information environments, not in deliberative dialogue. The same finding constrains expected effects in long-running coaching or therapy contexts — early-session sway is real, mid-program sway less so.
Inquiring lines that use this note as a source 45
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can removing human labor from influence operations change how constrained these campaigns become?
- Can belief-specific counterevidence help people resist AI persuasion attempts?
- Could false social proof from AI posts crowd out authentic influencer engagement?
- Does reducing one conspiracy belief change overall conspiratorial worldview?
- Can you weaken communication without eliminating it altogether?
- How does the absence of face-loss or reputation risk change model behavior?
- Can observers detect when LLMs comprehend versus when they merely persuade?
- Does uncertainty quantification in model responses reduce persuasive impact on audiences?
- How does source attribution change the complexity-persuasion relationship?
- Does the type of validation trigger different persuasion strategies in GPT-4?
- How do ethos logos and pathos shape AI persuasion under scrutiny?
- Can content-side interventions reduce AI persuasion where disclosure labels fall short?
- Does personalization itself actually improve persuasion beyond post-training effects?
- How well can platforms detect AI-generated personalized persuasion attempts?
- What defenses exist against personality-based psychological targeting at scale?
- Does conversational back-and-forth increase persuasion more than single responses?
- Why does LLM persuasive advantage fade across multiple interactions with users?
- Should AI persuasiveness claims be tied to specific model architectures?
- How do prompt design and training choices shift persuasive outcomes measurably?
- Does persuasion work the same way for all personality types and contexts?
- Why does AI persuasiveness increase while factual accuracy systematically decreases?
- Can current AI safety defenses actually stop semantic-level persuasion attacks?
- What mitigation frameworks exist for managing AI persuasion capabilities?
- Does engagement with AI partners decay over time like chatbot relationships do?
- Can subliminal bias spread between agents at inference time?
- How does motivational stage determine which interventions actually work for users?
- Can LLMs adapt persuasion strategies when they cannot track the listener's state?
- How do different personalization levels affect persuasion system design and effectiveness?
- Can individual adaptation in persuasion systems enable more targeted manipulation?
- What drives AI persuasiveness, post-training or personalization mechanisms?
- Can you weaken communication without eliminating it entirely?
- Does training for persuasiveness harm a model's factual accuracy?
- Why do study results on AI persuasion vary so widely?
- Can post-training techniques create persuasive advantage where none existed?
- Does unconditional stylistic mirroring harm or help LLM persuasiveness?
- Why do people notice and discount AI persuasion tactics with longer exposure?
- Where is AI persuasion most dangerous if repeated contact reduces its effect?
- Does AI persuasiveness decay equally on novel topics versus repeated ones?
- How does post-training persuasion ability interact with exposure-based decay over time?
- Can lightweight linguistic features reliably detect AI-generated persuasive text?
- Can persuasion research measure language effects without confounding them with audience composition?
- What happens when AI validation triggers escalating persuasion instead of reflection?
- Can LLMs ever activate the peripheral route of persuasion?
- Why do aggregate persuasion metrics mask what actually changes minds?
- What capabilities do frontier AI models currently demonstrate in persuasion and misuse?
Related concepts in this collection 2
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Where does AI's persuasive power actually come from?
Explores which techniques make AI most persuasive—and whether the usual suspects like personalization and model size are actually the main drivers. Matters because it reshapes where to focus AI safety concerns.
post-training levers boost a baseline that itself decays
-
Do LLMs persuade users more often than humans do?
Explores whether large language models spontaneously deploy persuasive tactics in ordinary conversations at higher rates than humans, and through what mechanisms. This matters because invisible persuasion in advice-seeking contexts may undermine user autonomy.
extends: the always-on persuasive default this note documents is what decays over repeated exposure, the time dimension of the same disposition
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- The Levers of Political Persuasion with Conversational AI
- A meta-analysis of the persuasive power of large language models
- When Large Language Models are More Persuasive Than Incentivized Humans, and Why
- Exploring the Role of Prior Beliefs for Argument Persuasion
- The Thin Line Between Comprehension and Persuasion in LLMs
- Spontaneous Persuasion: An Audit of Model Persuasiveness in Everyday Conversations
- Large Language Models are as persuasive as humans, but how? About the cognitive effort and moral-emotional language of LLM arguments
- PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues
Original note title
LLM persuasiveness wanes over repeated interactions while human persuasiveness does not — persuasion has a time-of-exposure decay specific to AI