Do language models overestimate how often irony appears?

This explores whether LLMs systematically misread ironic intent in text, assigning higher irony scores than humans do. The gap suggests models learn irony patterns from training data without understanding their actual frequency in real communication.

Synthesis note · 2026-03-26

GPT-4o can interpret ironic intent in emoji usage. But it systematically overestimates ironic intent compared to humans — the median irony score assigned by GPT-4o is significantly higher than human perception (p < .001). LLMs detect irony as a category but miscalibrate its prevalence (Irony in Emojis: A Comparative Study of Human and LLM Interpretation).

This overestimation reveals something important about how LLMs process pragmatic meaning. Irony detection is a pattern-matching success: the model has learned which textual features correlate with ironic intent in its training data. But ironic patterns are over-represented in training data relative to their actual frequency in human communication, because ironic usage is more salient, more commented upon, more explicitly labeled than sincere usage. The model learns the pattern but not the base rate.

This is a specific instance of a broader calibration problem. Since Why do preference models favor surface features over substance?, we know that training data artifacts systematically distort model judgments across multiple dimensions. Irony overestimation is the pragmatic version: the model's sense of "how often is this ironic?" is calibrated to training data saliency, not to real-world frequency.

The implication for literary analysis is significant. Literary irony is subtle, context-dependent, and often operates through understatement — exactly the opposite of the salient, explicitly marked irony that dominates training data. A model that over-reads ironic intent will find irony where an author intended none, and may miss genuine irony that operates through restraint rather than exaggeration. Since Can language models adapt implicature to conversational context?, the failure to calibrate irony to context is part of a larger pattern: LLMs apply fixed pragmatic templates where communicative context should modulate interpretation.

Inquiring lines that read this note 23

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

Why can't humans reliably detect AI-generated text despite measurable linguistic signatures?

Why do language models reinforce false assumptions instead of correcting them?

How faithfully do LLMs reflect their actual reasoning in outputs and explanations?

What happens when LLMs analyze literary irony that relies on understatement?

Why do language models struggle with implicit discourse relations?

What percentage of natural language relies on plausible deniability through ambiguous phrasing?

How do language models inherit human biases from training data?

How do training priors constrain what context information can override?

Can implicit linguistic information ever be reliably learned from training data?

How does latent reasoning compare to verbalized chain-of-thought?

How do discourse-level patterns reveal cognitive distortions better than individual statements?

How can emotions function as reliable information in reasoning and cognitive systems?

Can language models understand the implicit emotional intent behind questions?

Why should disagreement be treated as signal in collaborative reasoning?

Can discourse communities collectively detect disruptions individual readers miss?

What mechanisms enable AI systems to generate and spread false beliefs?

Can AI systems detect deception by monitoring real-time linguistic style matching patterns?

What factors beyond surface content determine how readers extract meaning differently?

Does AI fluency substitute for verifiable accuracy in human judgment?

How does this pattern match false punditry in AI commentary?

How can language models sustain linguistic synchrony and intersubjectivity during dialogue?

Can AI models predict whether alignment reads as warmth versus mockery in different cultures?

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 139 in 2-hop network ·dense cluster Open in graph ↗

Do language models overestimate how often irony … Why do preference models favor surface features ov… Can language models adapt implicature to conversat… Why do speakers deliberately use ambiguous languag…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Why do preference models favor surface features over substance? Preference models show systematic bias toward length, structure, jargon, sycophancy, and vagueness—features humans actively dislike. Understanding this 40% divergence reveals whether it stems from training data artifacts or architectural constraints.
calibration bias from training data saliency
Can language models adapt implicature to conversational context? Do large language models flexibly modulate scalar implicatures based on information structure, face-threatening situations, and explicit instructions—as humans do? This tests whether pragmatic computation is truly context-sensitive or merely literal.
fixed pragmatic templates where context should modulate
Why do speakers deliberately use ambiguous language? Explores whether ambiguity is a linguistic defect or a strategic tool speakers use for efficiency, politeness, and deniability. Matters because it challenges how we train language systems.
irony operates through productive ambiguity between literal and intended meaning

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

LLM irony detection systematically overestimates ironic intent — calibration bias reveals pattern recognition without pragmatic understanding

Do language models overestimate how often irony appears?

Inquiring lines that read this note 23

Related concepts in this collection 3

Related papers in this collection 8

Search by related questions 4