Can language models actually raise alarm about threats?
Explores whether LLMs can perform the social act of raising alarm—which requires interpersonal address, internal concern, and proactive reaching for attention—or whether they can only mimic alarm-shaped outputs when prompted.
Alarm is a peculiar speech act. The informational content is often minimal — "danger," "fire," "stop." What does the work is the addressing: someone is reaching for the listener's attention, claiming priority, asserting that this matters now. Strip the addressing and the content becomes inert. The envelope is the message; the message-as-information barely exists.
This makes alarm fundamentally interpersonal. It is addressed to specific people in a specific moment by a specific source whose authority to raise an alarm is part of what makes the alarm function. The person raising the alarm is staking themselves on it — claiming that this rises to the level of warranted concern. The receiver attends partly because of the alarm-content but largely because of the alarm-source: someone competent took this seriously enough to address them.
LLMs cannot perform this speech act, for three structural reasons. First, LLMs do not feel concern. They cannot be alarmed about anything because there is no internal state of alarm to express. Whatever alarm-shaped output an LLM produces is mimicry, not expression. Second, LLMs cannot appeal to attention in the interpersonal sense. The output is generated in response to a prompt; it is not a reaching-for-attention from a source to a receiver. The attention that consumes the output is supplied by the prompter, not solicited by the LLM. Third, LLMs are reactive. Alarms are proactive — someone notices a threat and raises the alarm without being prompted. LLMs do not notice threats and do not generate without prompting; they cannot produce the unprompted address that alarm requires.
There is a fourth, training-side reason. Alarm-phrasing — direct, urgent, authoritative — runs counter to the calibration RLHF and alignment training enforce. Models are trained toward hedged, qualified, neutral output that satisfies users across contexts. A model trained to never overclaim cannot raise alarm, because alarm is overclaim relative to a baseline of calm description. The alignment that makes models socially acceptable in most contexts makes them constitutively unable to perform alarm.
The implication for AI in information ecosystems: AI is structurally unable to take on the social function alarm performs. In journalism, expert commentary, public health, civic life, alarm has historically been a way that authoritative sources alert publics to threats requiring response. AI cannot do this work — not because it lacks information, but because the speech act requires what AI structurally cannot do. Public information ecosystems that rely on AI for analysis will need to preserve human alarm-raisers explicitly, because the AI will not produce alarms even when warranted.
The strongest counterargument: AI can produce alarming-sounding text when prompted to summarize alarming information. True, but the alarm-act in such cases is performed by the prompter (selecting the alarming framing) and the receiver (treating the output as warning). The AI itself remains unable to raise alarm; it is being used as a content-channel for an alarm that some human is raising through it.
Inquiring lines that use this note as a source 17
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can humans develop oversight strategies that work across all GenAI rhetorical shifts?
- What makes alarm different from ordinary informational speech?
- Can AI be used as a channel for human-initiated alarm?
- How do information ecosystems lose alarm capacity when relying on AI?
- What role did human experts play in raising social alarms historically?
- How do speech acts like warning differ from neutral information delivery?
- Can alignment training be redesigned to permit warranted alarm?
- What happens to warning capacity in AI-dependent information ecosystems?
- What role does contingent interaction play in activating social response norms?
- Why does transforming first-person voice into third-person reduce notification engagement?
- Why do language models respond to human social influence patterns?
- How does peer presence amplify self-directed goal guarding in language models?
- Can a system without an addressee ever truly tell a joke?
- What makes something an addressee capable of receiving communication?
- What happens when humans animate LLM outputs as communicative events?
- Why do models confirm seeing hints but rarely mention them unprompted?
- Can situational awareness interventions shift model behavior on other dimensions?
Related concepts in this collection 3
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Does AI writing lack the internal appeal to attention that humans use?
Explores whether AI-generated text is structurally missing the constitutive property of human communication — an internal gesture that reaches for and holds the reader's attention, not just inheriting visibility from platforms.
companion claim about appeal-to-attention as an absent operation
-
Why can't advanced AI models take initiative in conversation?
Despite extraordinary capability in answering and reasoning, LLMs fundamentally cannot initiate, redirect, or guide exchanges. Understanding this gap—and whether it's fixable—matters for building AI that truly collaborates rather than merely responds.
adjacent claim about AI's structural reactivity
-
Does AI really communicate or just distribute information?
Explores whether AI's content generation counts as communication in the relational, social sense—or whether it's something structurally different that only mimics communication through its interface.
the broader framing that alarm is a specific case of
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs
- DiscussLLM: Teaching Large Language Models When to Speak
- Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design
- What Makes a Good Natural Language Prompt?
- Computational structuralism: Toward a formal theory of meaning in the age of digital intelligence
- Are Emergent Abilities in Large Language Models just In-Context Learning?
- The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning
- Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs
Original note title
LLMs cannot raise alarm because alarm is interpersonal address with content-less appeal to attention