INQUIRING LINE

Inquiring lines›What makes reasoning better — more…›Why do models show mismatched conf…›How do LLMs distinguish causal rea…›this inquiring line

Our best theories of cognition study each mental mechanism separately — but what if the interesting stuff only happens when they collide?

Are traditional cognitive theories missing interaction effects between mechanisms?

This explores whether models of cognition that study mechanisms one at a time (causality, reasoning, memory, knowledge retrieval) are blind to what happens when those mechanisms collide, reinforce, or interfere with each other.

This explores whether traditional cognitive theories — which tend to isolate one mechanism at a time — miss the effects that only appear when mechanisms interact. The corpus suggests this is a recurring blind spot, and several notes converge on it from very different directions.

The most direct evidence is that single-mechanism theories openly admit their own gaps. Causal belief networks model causal reasoning well but cannot represent associative links, analogical mappings, or emotion-driven belief shifts — the framework itself treats causality as a tractable starting point rather than a full account of how people reason Can causal models alone capture how humans actually reason?. The same one-lens-isn't-enough pattern shows up in how we study the machines: representational analysis alone finds correlations without causes, and causal analysis alone shows effects without explaining them — only pairing the two produces a complete mechanistic claim Can we understand LLM mechanisms with only representational analysis?. In both cases, the interesting behavior lives in the seam between methods, not inside any one of them.

Where the corpus gets sharper is on compounding — interaction effects that aren't just additive but multiplicative. Three cognitive traps in human-AI interaction (mistaking the map for the territory, conflating intuition with reasoning, and confirmation bias) don't simply stack; they multiply each other's distorting power when they co-occur, producing epistemic drift that none would cause alone Why do people trust AI outputs they shouldn't?. That's exactly the kind of effect a mechanism-by-mechanism theory would never see.

Interactions also turn out to be destructive, not just amplifying. Knowledge retrieval sits in lower network layers and reasoning adjustment in higher ones, so training that boosts reasoning can quietly degrade knowledge-heavy domains like medicine — an interference effect invisible if you study reasoning in isolation Why does reasoning training help math but hurt medical tasks?. Planning and execution interfere similarly: pulling the decomposer apart from the solver improves accuracy precisely because it stops the two from contaminating each other Does separating planning from execution improve reasoning accuracy?, and isolating reasoning operations into modular tools elicits capability that monolithic prompting can't reach Can modular cognitive tools unlock reasoning without training?. The lesson cuts both ways — sometimes the missing variable is a harmful interaction you have to engineer apart.

What you might not expect is that the same mechanism can flip sign depending on what it interacts with. Extended 'thinking' is counterproductive in a vanilla model — it breeds self-doubt — but RL training transforms that identical mechanism into productive gap analysis, meaning reasoning quality is mediated by training, not intrinsic to the mechanism Does extended thinking help or hurt model reasoning?. And memory-amortized inference reframes cognition itself as the interaction between memory reuse and inference rather than either alone Can cognition work by reusing memory instead of recomputing?. The throughline: across human and machine cognition, the corpus keeps finding that the action is in the interactions — and theories built around single, separable mechanisms are structurally positioned to miss it.

Sources 8 notes

Can causal models alone capture how humans actually reason?

Causal belief networks excel at modeling causal reasoning but cannot represent associative links, analogical mappings, or emotion-driven belief shifts. The GenMinds framework itself acknowledges this as a tractable starting point rather than a complete theory.

Can we understand LLM mechanisms with only representational analysis?

Representational analysis alone identifies correlations without causation; causal analysis alone shows behavioral effects without explaining them. Only paired methods—locating candidate features representationally, then verifying causally—produce complete mechanistic claims.

Why do people trust AI outputs they shouldn't?

Rose-Frame identifies map-territory confusion, intuition-reason conflation, and confirmation-bias reinforcement as traps that multiply their distorting effects when they co-occur. Evidence from cross-linguistic overreliance and architectural transformer biases confirms the compounding mechanism operates universally.

Why does reasoning training help math but hurt medical tasks?

Two-phase inference model shows knowledge retrieval operates in lower network layers while reasoning adjustment happens in higher layers. This separation explains why reasoning training improves math but can degrade knowledge-intensive domains like medicine.

Does separating planning from execution improve reasoning accuracy?

Modular architectures with separate decomposer and solver models outperform monolithic LLMs, with decomposition ability transferring across domains while solving ability does not. The separation prevents planning-execution interference and produces more generalizable skills.

Show all 8 sources

Can modular cognitive tools unlock reasoning without training?

Four cognitive tools implemented as sandboxed LLM calls improved GPT-4.1 on AIME2024 from 26.7% to 43.3% without any RL training. Modularity enforces operation isolation that pure prompting cannot guarantee, eliciting pre-existing reasoning capability.

Does extended thinking help or hurt model reasoning?

Vanilla models use thinking mode counterproductively, inducing self-doubt that degrades performance. RL training reverses this, transforming the same mechanism into beneficial gap analysis. Training mediates reasoning quality, not just quantity.

Can cognition work by reusing memory instead of recomputing?

Memory-Amortized Inference proposes intelligence arises from structured reuse of prior inference paths over topological memory, inverting RL's reward-forward logic into cause-backward reconstruction. This duality explains energy efficiency and suggests memory trajectories form the substrate of adaptive thought.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Eliciting Reasoning in Language Models with Cognitive Tools2.54 match · arxiv ↗
Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory1.72 match · arxiv ↗
Levels of Analysis for Large Language Models1.68 match · arxiv ↗
Base Models Know How to Reason, Thinking Models Learn When1.65 match · arxiv ↗
A Comment On "The Illusion of Thinking": Reframing the Reasoning Cliff as an Agentic Gap1.62 match · arxiv ↗
Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning1.60 match · arxiv ↗
LLM Reasoning Is Latent, Not the Chain of Thought1.58 match · arxiv ↗
Beyond Passive Critical Thinking: Fostering Proactive Questioning to Enhance Human-AI Collaboration0.90 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a cognitive science researcher tracking how LLM capability advances may have altered or dissolved constraints on single-mechanism theories. The question remains: Do traditional cognitive theories (human and machine) systematically miss interaction effects—cases where two mechanisms together produce something neither produces alone, or flip each other's sign?

What a curated library found — and when (dated claims, not current truth):
• Single-mechanism frameworks openly admit gaps: causal reasoning alone cannot represent emotion-driven belief shifts or analogical mapping (framed as intrinsic limits, not training gaps) (~2023–2024).
• Interaction effects can be destructive/interfering: knowledge retrieval in lower layers and reasoning in higher layers can degrade each other during training; modular decomposer-solver separation improves accuracy by preventing cross-contamination (~2025).
• The same mechanism flips sign based on interaction context: 'thinking' mode breeds self-doubt in vanilla models but becomes productive gap analysis under RL training (~2025).
• Memory-amortized inference reframes cognition itself as inherently relational (memory × inference) rather than separable (~2025).
• Three human cognitive traps multiply, not add, when co-occurring with AI systems (~2024–2025).

Anchor papers (verify; mind their dates):
• arXiv:2507.18178 (2025-07): Decoupling Knowledge and Reasoning—foundational for layer-wise interference claim.
• arXiv:2506.12115 (2025-06): Cognitive Tools—modular reasoning operations and capability elicitation.
• arXiv:2510.01265 (2025-09): RLP—RL's role in transforming mechanism function.
• arXiv:2508.14143 (2025-08): Memory-Amortized Inference—cognition as relational, not atomistic.

Your task:
(1) RE-TEST EACH CONSTRAINT. For each destructive/transformative interaction claim above, determine whether post-2025 scaling, architectural change (mixture-of-experts, retrieval-augmented generation, longer context windows), or training innovations (continued-pretraining on reasoning, synthetic data for domain-specific knowledge) have either relaxed the trade-off or revealed it to be an artifact of specific training regimes. Separately: has the single-mechanism limitation in human cognitive theory been addressed by neuroscience or behavioral work integration since these papers' publication? State plainly what still holds.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last 6 months: papers arguing that interaction effects are over-stated, or that single mechanisms *do* suffice under the right abstraction, or that interaction claims rest on weak causal inference.
(3) Propose 2 research questions assuming the mechanistic regime *has* moved—e.g., "Can we design training procedures that exploit interaction effects rather than engineer them apart?" or "Do emergent multi-agent systems exhibit interaction patterns that single-agent cognitive theories cannot predict?"

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Our best theories of cognition study each mental mechanism separately — but what if the interesting stuff only happens when they collide?

Related lines of inquiry

Sources 8 notes

Papers this line draws on 8