INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›How should agents manage informati…›Can AI-generated outputs constitut…›this inquiring line

An AI that knows every world fact still can't necessarily simulate what would happen if you changed one.

What's the difference between representing world facts and generating world mechanisms?

This explores the gap between an AI that can store accurate facts about the world and one that can simulate how the world actually works — the difference between a coherent map of what is and a runnable engine of what would happen if.

This explores the gap between representing world facts and generating world mechanisms — between an LLM that holds accurate descriptions of the world and one that can run a model of how the world changes. The corpus draws this line sharply. LLMs are very good at the first: they extract coherent factual structure from text and can lay out what is true. But the same probing evidence shows they stumble at the second — reasoning that requires counterfactual manipulation or causal intervention — because they lean on task-specific heuristics rather than a generative model of how things work Do LLMs actually have world models or just facts?. The word "world model" itself hides this ambiguity: it can mean a tidy representation of facts, or a machine that simulates consequences, and these are not the same achievement.

The distinction matters because high prediction accuracy can be a mirage. A system can nail next-token or next-observation predictions through surface regularities while having no engine underneath that lets it ask "what if I intervened here?" What makes a world model actually useful for reasoning?. That's why one line of work reframes the entire goal of a world model away from passive prediction and toward simulating actionable possibility spaces — physical, social, counterfactual, embodied — grounded in an agent's decisions rather than in forecasting the next frame What should a world model actually be designed to do?. Generating mechanisms means being able to run hypotheticals; representing facts only means being able to recite them.

There's a deeper methodological echo here. Studying whether a model truly has mechanisms — not just correlated features — requires more than reading off its representations. Representational analysis alone finds correlations without causation; you have to intervene causally to confirm a mechanism is actually doing work Can we understand LLM mechanisms with only representational analysis?. So the fact/mechanism split shows up twice: once in what the model possesses (facts vs. a causal engine), and again in how we'd even verify the difference (correlation vs. intervention). And it surfaces yet again in pretraining itself: factual recall depends on narrow, document-specific memorization, while reasoning that generalizes rides on broad procedural knowledge spread across many sources — two different things the model learns in two different ways Does procedural knowledge drive reasoning more than factual retrieval?.

What you didn't know you wanted to know: this isn't a single yes/no verdict but a design space. One framing decomposes any world model into five separable choices — data, latent representation, reasoning architecture, training objective, and how it plugs into decisions — and the point is that each can quietly misalign with the others, so "does the model have a world model?" is the wrong question; "which of these five is failing?" is the right one What five design choices compose a world model?. There's even a counterweight to the skeptics: by extracting regularities from text written by causally grounded humans, LLMs may acquire a kind of indirect causal grounding — real but mediated, with gaps that block real-time verification and updating Can large language models develop genuine world models without direct environmental contact?. Representing facts is having the map; generating mechanisms is being able to redraw the map when the territory changes.

Sources 7 notes

Do LLMs actually have world models or just facts?

LLMs coherently represent factual world structure from text but fail at mechanistic reasoning requiring counterfactual manipulation or causal intervention. Probe evidence shows they rely on task-specific heuristics rather than generative models of how the world works.

What makes a world model actually useful for reasoning?

Research shows LLMs may achieve high prediction accuracy through task-specific heuristics without developing coherent generative models of how the world works. True world models must enable reasoning about interventions and counterfactuals, not surface regularities.

What should a world model actually be designed to do?

Drawing on hypothetical thinking in psychology, world models are most useful when designed to simulate all actionable possibility spaces—physical, embodied, emotional, social, mental, counterfactual, and evolutionary—grounded in agent decision-making rather than passive prediction.

Can we understand LLM mechanisms with only representational analysis?

Representational analysis alone identifies correlations without causation; causal analysis alone shows behavioral effects without explaining them. Only paired methods—locating candidate features representationally, then verifying causally—produce complete mechanistic claims.

Does procedural knowledge drive reasoning more than factual retrieval?

Analysis of 5 million pretraining documents shows reasoning relies on broad, transferable procedural knowledge from diverse sources, unlike factual recall which depends on narrow, document-specific memorization of target facts.

Show all 7 sources

What five design choices compose a world model?

World model design comprises five distinct dimensions: data preparation, latent representation, reasoning architecture, training objective, and decision-system integration. Each can misalign with the others, and treating them as a single problem obscures where failures originate and prevents proper evaluation.

Can large language models develop genuine world models without direct environmental contact?

LLMs form structured world representations by extracting regularities from training data produced by causally grounded humans. This constitutes indirect causal grounding mediated through text, though the chain has gaps that limit real-time verification and model updating.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Qwen-AgentWorld: Language World Models for General Agents3.23 match · arxiv ↗
Can Language Models Serve as Text-Based World Simulators?2.45 match · arxiv ↗
Language Models’ Hall of Mirrors Problem: Why AI Alignment Requires Peircean Semiosis2.44 match · arxiv ↗
Looped World Models2.36 match · arxiv ↗
“Understanding AI”: Semantic Grounding in Large Language Models1.73 match · arxiv ↗
Eliciting Reasoning in Language Models with Cognitive Tools1.67 match · arxiv ↗
Computational structuralism: Toward a formal theory of meaning in the age of digital intelligence1.63 match · arxiv ↗
Word Meanings in Transformer Language Models1.63 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

As an AI researcher auditing world-model claims in LLMs, assess whether the fact/mechanism split still holds or has narrowed. A curated library (2023–2026) found:

• LLMs excel at factual representation but stumble at counterfactual reasoning and causal intervention—they use task-specific heuristics rather than generative causal models (2024–2025).
• High next-token prediction accuracy can mask absent mechanisms; passive prediction ≠ actionable simulation (2024).
• Verifying mechanisms requires causal intervention, not representational analysis alone; correlation ≠ causation (2024–2025).
• Factual recall rides on narrow memorization; reasoning generalization depends on procedural knowledge spread across sources (2024).
• World-model success decomposes into five separable design choices (data, latent representation, reasoning, objective, decision coupling)—misalignment among them is the real failure mode (2025).

Anchor papers (verify; mind their dates): arXiv:2406.06485 (Can Language Models Serve as Text-Based World Simulators?), arXiv:2411.12580 (Procedural Knowledge in Pretraining), arXiv:2507.05169 (Critiques of World Models), arXiv:2604.15726 (LLM Reasoning Is Latent, Not the Chain of Thought).

Your task:
(1) RE-TEST THE SPLIT. For each constraint above, ask: have newer training regimes (e.g., process reward models, tree-search integration, or multi-step reasoning orchestration), tooling (mechanistic probes, causal graphs), or model scaling since 2026 relaxed the fact/mechanism boundary? Which remain tight? Where has counterfactual reasoning improved, and through what path?
(2) Surface the sharpest RECONCILING work from the last 6 months—does any paper dissolve the split by showing facts and mechanisms are not orthogonal, or that one architecture solves both?
(3) Propose 2 open questions: (a) If procedural knowledge drives generalization, can we isolate it and train it directly, bypassing the fact/mechanism split? (b) Do self-organizing agents (2025+) overcome the intervention bottleneck by learning in closed-loop?

Cite arXiv IDs; flag anything ungrounded.

An AI that knows every world fact still can't necessarily simulate what would happen if you changed one.

Related lines of inquiry

Sources 7 notes

Papers this line draws on 8