Do language models and humans respond to word frequency the same way?

Both LLMs and humans show stronger responses to high-frequency words. This raises a puzzle: if models mirror human neural patterns, what actually makes them different from human language processing?

Synthesis note · 2026-05-02 · sourced from Natural Language Inference

Adam's Law's literature review surfaces an inconvenient symmetry. Desai et al. (2020) and Alexandrov et al. (2011) found that high-frequency words evoke stronger neural responses in human readers than low-frequency words during reading tasks. Heylen et al. (2008) found high-frequency target words have higher semantic similarity to nearest-neighbor words in distributional analyses — frequency drives perceived semantic similarity. Mohan and Weber (2019) document frequency effects on semantic retrieval. The frequency-comprehension link is not an LLM-specific artifact. Humans show it too, at the neural level.

This complicates the easy "LLMs are aliens" framing that often accompanies critiques like Do LLMs compress concepts more aggressively than humans do?. At the level of statistical exposure to text, models and human readers occupy the same regime: both privilege the frequent. The convergence is not coincidence; both systems are exposed to the same statistical structure of language — the shape of natural language is not neutral, and the shape leans on frequency. Word frequency is a property of the linguistic environment, not just a property of how LLMs process that environment.

But the symmetry is partial, and the asymmetry is what matters. Humans can override frequency through attention, context, and intention: a doctor reading a rare term in a clinical context can attend to it carefully despite its rarity; a poet can foreground low-frequency words deliberately. The override mechanism is what Why do dialogue failures persist despite scaling language models? indirectly identifies — humans are trained dialogically with goal-relevant attention shaping comprehension; LLMs are trained monologically with no equivalent override channel. The model cannot bracket frequency when frequency is irrelevant to the current goal because there is no current goal that can take priority over the statistical prior. The frequency response is the same across human and machine; the capacity to not be governed by it is what humans have and the architecture lacks. This refines the alien framing: the divergence is not in the response, it is in the override.

Inquiring lines that read this note 3

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How do language models establish social grounding in human dialogue?

How do LLMs differ from humans in their grounding mechanisms?

Do language models learn genuine linguistic structure or just surface patterns?

Related concepts in this collection 2

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 120 in 2-hop network ·dense cluster Open in graph ↗

Do language models and humans respond to word fr… Do LLMs compress concepts more aggressively than h… Why do dialogue failures persist despite scaling l…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Do language models and humans respond to word frequency the same way?

Inquiring lines that read this note 3

Related concepts in this collection 2

Related papers in this collection 8

Search by related questions 4