SYNTHESIS NOTE

Topics›Reasoning Logic Internal Rules›this note

Do large language models reason symbolically or semantically?

Can LLMs follow explicit logical rules when those rules contradict their training knowledge? Testing whether reasoning operates independently of semantic associations reveals what computational mechanisms actually drive LLM multi-step inference.

Synthesis note · 2026-02-22 · sourced from Reasoning Logic Internal Rules

The "In-Context Semantic Reasoners" paper tests a fundamental question about what drives LLM reasoning by systematically decoupling semantics from the reasoning process across deduction, induction, and abduction tasks. The findings are clear: when semantics are consistent with commonsense, LLMs perform well; when semantics are removed or made counter-commonsense, performance collapses even when correct rules are provided in context.

The experimental design is precise. By replacing relation labels with shuffled alternatives ("motherOf" → "sisterOf", "female" → "male"), the researchers create tasks where the in-context rules are logically valid but semantically counter-intuitive. LLMs cannot follow these counter-commonsense rules despite having them explicitly in the prompt. The model's parametric knowledge — its compressed commonsense from training — overrides the in-context logical structure.

This reveals a specific computational mechanism: LLMs create "superficial logical chains" through semantic token associations, not through symbolic manipulation. The connections between tokens that enable multi-step reasoning are semantic connections, not logical ones. When those semantic connections support the correct answer, reasoning appears to work. When they conflict, reasoning fails regardless of what the prompt says.

The implication is that LLM reasoning is fundamentally bounded by training distribution semantics. Since Can large language models translate natural language to logic faithfully?, the failure is bidirectional: LLMs can neither translate TO formal logic faithfully nor reason FROM formal logic when it conflicts with semantic priors. Since Do foundation models learn world models or task-specific shortcuts?, the semantic dependency IS the heuristic — the model uses semantic similarity as a proxy for logical validity.

This connects to the Dual Process Theory framework: human System II symbolic reasoning operates independently of semantic content, but LLM "reasoning" remains entangled with System I semantic associations. The paper's suggestion — integrating LLMs with external non-parametric knowledge bases and improving in-context knowledge processing — implicitly acknowledges that the LLM alone cannot escape this limitation.

Retort implication — rules out a class of anthropomorphization: The finding constrains what we can say about LLM behavior in other domains. Any account that treats LLMs as agents who "reverse-engineer" justifications for conclusions they have committed to — the standard anthropomorphization of sycophancy, rationalization, or motivated reasoning — presupposes the semantic competence this note shows LLMs lack. If reasoning collapses when semantics are decoupled, there is no separable reasoning faculty available to perform a post-hoc rationalization. What looks like reverse-engineering is pattern-matching within semantic associations. This rules out a whole class of AI commentary that treats LLMs as dishonest agents who could have reasoned correctly but chose not to.

Metaphor as paradigmatic semantic decoupling: Metaphor is the literary instantiation of this finding. A metaphor works by using one domain's vocabulary to illuminate another — "time is money," "argument is war," "memory is a jar of flies." The decoupling between the source domain's semantics and the target domain's meaning is the defining feature of metaphorical language. Since LLM reasoning collapses when semantics are decoupled from their typical packaging, and metaphor is decoupled semantics, this predicts a specific failure mode: LLMs should handle conventional metaphors (lexicalized, semantically consistent with commonsense) better than novel literary metaphors (where the mapping between domains is unexpected and requires conceptual reasoning beyond semantic association). The Diplomat dataset (Diplomat: A Dialogue Dataset for Situated PragMATic Reasoning) suggests treating all figurative language as a unified pragmatic reasoning task — but the semantic-decoupling finding predicts that this unified approach will hit a wall at the novelty threshold where metaphors stop relying on conventional semantic associations.

Inquiring lines that read this note 258

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How do language models establish social grounding in human dialogue?

How faithfully do LLMs reflect their actual reasoning in outputs and explanations?

How do language models inherit human biases from training data?

Can prompting inject entirely new knowledge into language models?

Can prompting strategies overcome LLM biases without model fine-tuning?

How should we design LLM systems to maintain alignment and control?

How do knowledge graphs enable efficient multi-hop reasoning over alternatives?

Can AI-generated outputs constitute genuine knowledge or valid claims?

Does recurrence enable reasoning capabilities that fixed-depth transformers cannot achieve?

How do transformer attention mechanisms implement memory and algorithmic functions?

Do modern architectures in NLP and vision rely on dot products intentionally?

Why do correct reasoning traces tend to be shorter than incorrect ones?

How do training priors constrain what context information can override?

How effectively do deterministic tools improve language model reasoning on formal tasks?

What limits mechanistic interpretability's ability to characterize models?

Can neural networks represent symbolic structures without explicit mechanisms?

Do language models understand semantics or rely on pattern matching?

How do evaluation biases undermine LLM quality assessment systems?

Why do LLM outputs match researcher priors without solving tasks correctly?

Do language models learn genuine linguistic structure or just surface patterns?

How do training data properties shape reasoning capability development?

How do LLMs distinguish causal reasoning from temporal and semantic associations?

How does reasoning graph topology affect breakthrough insights and generalization?

Why do contrastive reasoning approaches outperform single-path belief evaluation?

What structural advantages do diffusion language models offer over autoregressive methods?

What mechanisms enable AI systems to generate and spread false beliefs?

Why do reasoning models fail at systematic problem-solving and search?

Do language models perform faithful symbolic reasoning independent of semantic grounding?

Why do language models struggle with implicit discourse relations?

Why do semantic similarity and task relevance diverge in vector embeddings?

When does architectural design matter more than raw model capacity?

Why do power-law distributions make standard ML infrastructure assumptions fail?

Why do benchmark improvements fail to reflect actual reasoning quality?

Why do language models reinforce false assumptions instead of correcting them?

Is embodied interaction necessary for language meaning and genuine agency?

How does example difficulty affect learning efficiency in language models?

Why do models fail on logically equivalent tasks with different data distributions?

How do neural networks separate factual knowledge from reasoning abilities?

Can inference-time compute substitute for scaling up model parameters?

How should retrieval systems optimize for multi-step reasoning during inference?

When does long-context LLM reasoning fail where structured retrieval succeeds?

How does latent reasoning compare to verbalized chain-of-thought?

Which computational strategies best support reasoning in language models?

What critical LLM failures do standard benchmarks hide?

Do reasoning traces faithfully represent or merely mimic actual model reasoning?

Do language models develop causal world models or rely on statistical patterns?

Do base models contain latent reasoning that training can unlock?

Why does training format shape reasoning strategy more than domain content?

Can next-token prediction alone produce genuine language understanding?

What structural biases does transformer attention create in language model outputs?

What architectural features drive sycophancy closer to inference than training?

What mechanisms drive sycophancy and how can we mitigate it?

Why do reasoning-optimized models show no sycophancy resistance advantage?

Can language model hallucination be prevented or only managed?

How does interleaving reasoning with action prevent hallucination in language models?

Can model routing outperform monolithic scaling as an efficiency strategy?

How does semantic clustering help decide which model handles each query?

What role does compression play in language model capability and generalization?

How much does schema bloat actually degrade reasoning in large language models?

What capability tradeoffs emerge when scaling model reasoning abilities?

Why do continual learning scenarios trigger catastrophic forgetting and interference?

What non-parametric methods could replace latent factors for inductive learning?

Do accurate-looking LLM outputs hide structural failures in learning and reasoning?

Why do LLMs recognize graph entities without modeling their relationships?

How should models express uncertainty rather than forced confident answers?

Can machine learning encode pragmatic reasoning about when rules should bend?

Does reinforcement learning teach reasoning or just when to reason?

How do soft continuous representations explore multiple reasoning paths simultaneously?

How do soft token mixtures enable parallel reasoning exploration without explicit training?

What actually drives chain-of-thought reasoning improvements in language models?

Why does unstructured chain-of-thought permit assumption-based errors that templates prevent?

Does model scaling alone produce compositional generalization without symbolic mechanisms?

How does rhetorical adaptation affect LLM persuasion and detectability?

How do different LLMs converge on similar argumentative structures independently?

How does sequence length affect sparsity tolerance in models?

Can non-variational posterior approximation schemes deliver comparable reasoning improvements?

Related concepts in this collection 6

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

23 direct connections · 180 in 2-hop network ·medium cluster Open in graph ↗

Do large language models reason symbolically or … Can large language models translate natural langua… Do foundation models learn world models or task-sp… Why do language models ignore information in their… Does semantic grounding in language models come in… Why do neural networks fail at compositional gener… Do LLMs actually have world models or just facts?

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can large language models translate natural language to logic faithfully? This explores whether LLMs can convert natural language statements into formal logical representations without losing meaning. It matters because faithful translation is essential for any AI system that reasons formally or verifies specifications.
bidirectional semantic dependency: fails translating TO logic and reasoning FROM logic
Do foundation models learn world models or task-specific shortcuts? When transformer models predict sequences accurately, are they building genuine world models that capture underlying physics and logic? Or are they exploiting narrow patterns that fail under distribution shift?
semantic associations are the heuristic mechanism
Why do language models ignore information in their context? Explores why language models sometimes override contextual information with prior training associations, and whether providing more context can solve this problem.
same mechanism: parametric knowledge overrides in-context information
Does semantic grounding in language models come in degrees? Rather than asking whether LLMs truly understand meaning, this explores whether grounding is actually a multi-dimensional spectrum. The question matters because it reframes the sterile understand/don't-understand debate into measurable, distinct capacities.
functional grounding through semantic associations explains why reasoning works within commonsense boundaries
Why do neural networks fail at compositional generalization? Exploring whether the binding problem from neuroscience explains neural networks' inability to systematically generalize. The binding problem has three aspects—segregation, representation, and composition—each creating distinct failure modes in how networks handle structured information.
the binding problem may explain WHY semantic decoupling collapses reasoning: without compositional binding mechanisms, removing semantic content removes the only glue holding multi-step inference together; semantic associations serve as a substitute for genuine compositional binding
Do LLMs actually have world models or just facts? The term 'world model' conflates two different capabilities: factual representation versus mechanistic understanding. Understanding which one LLMs actually possess matters for assessing their reasoning reliability.
semantic reasoning operates on factual world representation (Sense 1) but cannot perform mechanistic reasoning (Sense 2) when logic must override semantic priors

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

llms are in-context semantic reasoners not symbolic reasoners — when semantics are decoupled reasoning collapses

Do large language models reason symbolically or semantically?

Inquiring lines that read this note 258

Related concepts in this collection 6

Related papers in this collection 8

Search by related questions 5