INQUIRING LINE

Model Architecture and Internals · Reasoning, Retrieval, and Evaluation · Psychology, Society, and Alignmentcross-cluster

How do world models decompose between representation of facts versus generative mechanisms?

This explores whether a world model is one thing or two — a store of facts about the world versus a generative engine that can run forward, simulate interventions, and answer 'what if' — and what the corpus says about how (and whether) those two parts come apart.

This question reads world models as having two separable jobs: holding facts about how things are, and running a mechanism that can generate what would happen next. The corpus suggests these don't just decompose — they're driven by different machinery and fail in different ways. The cleanest split comes from analysis of pretraining data: factual recall leans on narrow, document-specific memorization (the fact lives in particular documents and is retrieved), while reasoning and generalization ride on broad, transferable procedural knowledge spread across many sources Does procedural knowledge drive reasoning more than factual retrieval?. Facts are looked up; mechanisms are abstracted. That's the decomposition in its rawest form.

Where it gets interesting is that a model can ace the fact layer while having no real generative mechanism underneath. A model can hit high prediction accuracy using task-specific heuristics — surface regularities that look like understanding — without ever building a coherent generative model of how the world works. The test that separates them is whether the model can reason about interventions and counterfactuals, not just predict observations What makes a world model actually useful for reasoning?. So the two halves aren't merely distinct; the representational half can masquerade as the whole, which is exactly why you can't infer a working mechanism from accurate facts.

This maps onto a deeper methodological point: representation and mechanism need different tools to even study. Representational analysis finds what's encoded (correlations, features), but only causal analysis shows what actually drives behavior — and a complete mechanistic claim requires pairing the two Can we understand LLM mechanisms with only representational analysis?. The decomposition in world models mirrors the decomposition in how we investigate them. And there's a self-knowledge wrinkle: models carry a separate mechanism for tracking whether they *have* a fact at all — an entity-recognition circuit that steers between answering and refusing Do models know what they don't know?. The fact store, the generative engine, and the 'do I even know this' detector are three different things.

If the representational layer is largely looked-up memory, what is the generative mechanism actually made of? Two threads point the same direction: it's iterative, not static. World models scale through adaptive depth rather than parameter count — refining a latent environment state by looping computation, spending more steps on harder predictions, mirroring how physical systems unfold over time Can looped computation replace parameter count in world models?. The broader looped-architecture work makes the same case: recursion buys state-tracking and compositional generalization that raw scale can't Can models learn by looping instead of growing larger?. The mechanism is a process you run, while facts are content you store. And reasoning itself seems to live in those latent-state trajectories rather than in the surface text the model emits Where does LLM reasoning actually happen during generation?.

The twist worth leaving with: where do the facts come from if the model never touches the world? The corpus argues LLMs get indirect causal grounding — they extract structured regularities from text written by causally grounded humans, inheriting a world model secondhand through language Can large language models develop genuine world models without direct environmental contact?. Pushed further, one view holds the model only ever operates on relational structure compressed from text, with no external referent at all Can language models learn meaning without engaging the world?. That reframes the whole decomposition: the 'facts' are borrowed grounding and the 'mechanism' is a relational engine — which is precisely why the chain has gaps when it comes to verifying or updating in real time.

Sources 9 notes

Does procedural knowledge drive reasoning more than factual retrieval?

Analysis of 5 million pretraining documents shows reasoning relies on broad, transferable procedural knowledge from diverse sources, unlike factual recall which depends on narrow, document-specific memorization of target facts.

What makes a world model actually useful for reasoning?

Research shows LLMs may achieve high prediction accuracy through task-specific heuristics without developing coherent generative models of how the world works. True world models must enable reasoning about interventions and counterfactuals, not surface regularities.

Can we understand LLM mechanisms with only representational analysis?

Representational analysis alone identifies correlations without causation; causal analysis alone shows behavioral effects without explaining them. Only paired methods—locating candidate features representationally, then verifying causally—produce complete mechanistic claims.

Do models know what they don't know?

Sparse autoencoders revealed that language models develop causal mechanisms for detecting whether they know facts about entities. These mechanisms actively steer both hallucination and refusal behavior, and persist from base models into finetuned chat versions.

Can looped computation replace parameter count in world models?

LoopWM achieves up to 100x parameter efficiency by refining latent environment states through iterative computation in a shared block, with spectral-norm constraints providing formal stability guarantees. The approach mirrors physical system recurrence, spending more depth on harder prediction steps.

Can models learn by looping instead of growing larger?

Models that re-apply layers in recurrent depth outperform larger feedforward networks on reasoning tasks. This works because recursion enables state tracking and compositional generalization that parameter scaling alone cannot achieve, with convergence signals providing natural halting.

Where does LLM reasoning actually happen during generation?

Evidence from CoT faithfulness tests, feature steering, and layer analysis suggests latent-state dynamics drive reasoning, while surface chain-of-thought serves as a partial interface. Hidden reasoning processes should be the default focus of study.

Can large language models develop genuine world models without direct environmental contact?

LLMs form structured world representations by extracting regularities from training data produced by causally grounded humans. This constitutes indirect causal grounding mediated through text, though the chain has gaps that limit real-time verification and model updating.

Can language models learn meaning without engaging the world?

Research shows LLMs learn culturally situated discourse patterns by compressing relational structure from text, demonstrating that fluent language generation requires no external referents or embodied grounding.

How do world models decompose between representation of facts versus generative mechanisms?

Sources 9 notes

Next inquiring lines