SYNTHESIS NOTE

Why do language models fail to use knowledge they possess?

Large language models contain relevant world knowledge but often fail to activate it without explicit cues. This explores whether the bottleneck lies in knowledge storage or in the inference process that decides what background facts apply.

Synthesis note · 2026-05-01 · sourced from Linguistics, NLP, NLU

The Heuristic Override Benchmark study includes an explicitness gradient: same problem, varying the salience of the cue that should activate the constraint. The result decisively locates the failure. Adding a single subtle hint — for example emphasizing the key object that must be present ("get my car washed") — raises accuracy +15.3 percentage points on average across 14 models, from 59.2 percent to 74.5 percent.

This is diagnostic. If models lacked the relevant world knowledge, no amount of surface emphasis would help. They have the knowledge. The problem is that the knowledge does not get activated unless the prompt cues it explicitly. The bottleneck is not in storage, retrieval capacity, or chain-of-thought depth. It is in the inferential step that decides which background facts are relevant to the current decision.

Goal-decomposition prompting tells the same story from another angle. Forcing the model to enumerate preconditions before answering — to ask "what must be true for walking to be the right choice here?" — recovers 6 to 9 percentage points. The intervention works because it converts the implicit constraint into a self-generated explicit hint. The model can do the reasoning when it is forced to enumerate; it cannot reliably trigger the enumeration on its own when a salient surface heuristic is competing for attention.

The implication for deployment is unsettling. Standard prompts do not activate the relevant knowledge. The fact that the knowledge is present in the model does not mean it will be brought to bear on the decision the user actually needs the model to make. Knowledge possession and knowledge activation are decoupled.

Inquiring lines that read this note 8

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How do neural networks separate factual knowledge from reasoning abilities?

How does memorization interact with learning and generalization?

How much does memorization capacity limit a model's ability to learn new information?

Do language models learn genuine linguistic structure or just surface patterns?

How faithfully do LLMs reflect their actual reasoning in outputs and explanations?

How can a model explain something correctly yet fail to apply it?

Is model self-awareness based on genuine introspection or pattern matching?

Do models verbalize their implicit knowledge when that knowledge influences their output?

Do base models contain latent reasoning that training can unlock?

What makes a model fail to activate relevant skills from its own harness?

Why do language models fail to use knowledge they possess?

Inquiring lines that read this note 8

Related papers in this collection 8

Search by related questions 4