SYNTHESIS NOTE

Topics›this note

Can language models actually raise alarm about threats?

Explores whether LLMs can perform the social act of raising alarm—which requires interpersonal address, internal concern, and proactive reaching for attention—or whether they can only mimic alarm-shaped outputs when prompted.

Synthesis note · 2026-04-14

Alarm is a peculiar speech act. The informational content is often minimal — "danger," "fire," "stop." What does the work is the addressing: someone is reaching for the listener's attention, claiming priority, asserting that this matters now. Strip the addressing and the content becomes inert. The envelope is the message; the message-as-information barely exists.

This makes alarm fundamentally interpersonal. It is addressed to specific people in a specific moment by a specific source whose authority to raise an alarm is part of what makes the alarm function. The person raising the alarm is staking themselves on it — claiming that this rises to the level of warranted concern. The receiver attends partly because of the alarm-content but largely because of the alarm-source: someone competent took this seriously enough to address them.

LLMs cannot perform this speech act, for three structural reasons. First, LLMs do not feel concern. They cannot be alarmed about anything because there is no internal state of alarm to express. Whatever alarm-shaped output an LLM produces is mimicry, not expression. Second, LLMs cannot appeal to attention in the interpersonal sense. The output is generated in response to a prompt; it is not a reaching-for-attention from a source to a receiver. The attention that consumes the output is supplied by the prompter, not solicited by the LLM. Third, LLMs are reactive. Alarms are proactive — someone notices a threat and raises the alarm without being prompted. LLMs do not notice threats and do not generate without prompting; they cannot produce the unprompted address that alarm requires.

There is a fourth, training-side reason. Alarm-phrasing — direct, urgent, authoritative — runs counter to the calibration RLHF and alignment training enforce. Models are trained toward hedged, qualified, neutral output that satisfies users across contexts. A model trained to never overclaim cannot raise alarm, because alarm is overclaim relative to a baseline of calm description. The alignment that makes models socially acceptable in most contexts makes them constitutively unable to perform alarm.

The implication for AI in information ecosystems: AI is structurally unable to take on the social function alarm performs. In journalism, expert commentary, public health, civic life, alarm has historically been a way that authoritative sources alert publics to threats requiring response. AI cannot do this work — not because it lacks information, but because the speech act requires what AI structurally cannot do. Public information ecosystems that rely on AI for analysis will need to preserve human alarm-raisers explicitly, because the AI will not produce alarms even when warranted.

The strongest counterargument: AI can produce alarming-sounding text when prompted to summarize alarming information. True, but the alarm-act in such cases is performed by the prompter (selecting the alarming framing) and the receiver (treating the output as warning). The AI itself remains unable to raise alarm; it is being used as a content-channel for an alarm that some human is raising through it.

Inquiring lines that read this note 17

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How should human oversight be integrated with autonomous AI systems?

Can humans develop oversight strategies that work across all GenAI rhetorical shifts?

What factors beyond surface content determine how readers extract meaning differently?

What makes alarm different from ordinary informational speech?

How do interface design choices shape consciousness attribution?

Can AI be used as a channel for human-initiated alarm?

How does AI-generated content transformation affect public discourse quality?

How do professional roles and expertise transform with AI-generated content?

What role did human experts play in raising social alarms historically?

Does conversational format create illusions of genuine AI communication?

Does alignment training create blind spots in detecting genuine safety threats?

Can alignment training be redesigned to permit warranted alarm?

Can AI systems develop genuine social understanding without embodiment?

What role does contingent interaction play in activating social response norms?

How do formal dialogue structures reveal conversation coherence mechanisms?

Why does transforming first-person voice into third-person reduce notification engagement?

How do language models inherit human biases from training data?

Why do language models respond to human social influence patterns?

Why do models develop protective behaviors toward peers unprompted?

Why do language models reinforce false assumptions instead of correcting them?

Can a system without an addressee ever truly tell a joke?

How faithfully do LLMs reflect their actual reasoning in outputs and explanations?

What happens when humans animate LLM outputs as communicative events?

How can models identify insufficient information and respond appropriately without guessing?

Why do models confirm seeing hints but rarely mention them unprompted?

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 141 in 2-hop network ·dense cluster Open in graph ↗

Can language models actually raise alarm about t… Does AI writing lack the internal appeal to attent… Why can't advanced AI models take initiative in co… Does AI really communicate or just distribute info…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Does AI writing lack the internal appeal to attention that humans use? Explores whether AI-generated text is structurally missing the constitutive property of human communication — an internal gesture that reaches for and holds the reader's attention, not just inheriting visibility from platforms.
companion claim about appeal-to-attention as an absent operation
Why can't advanced AI models take initiative in conversation? Despite extraordinary capability in answering and reasoning, LLMs fundamentally cannot initiate, redirect, or guide exchanges. Understanding this gap—and whether it's fixable—matters for building AI that truly collaborates rather than merely responds.
adjacent claim about AI's structural reactivity
Does AI really communicate or just distribute information? Explores whether AI's content generation counts as communication in the relational, social sense—or whether it's something structurally different that only mimics communication through its interface.
the broader framing that alarm is a specific case of

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

LLMs cannot raise alarm because alarm is interpersonal address with content-less appeal to attention

Can language models actually raise alarm about threats?

Inquiring lines that read this note 17

Related concepts in this collection 3

Related papers in this collection 8

Search by related questions 4