INQUIRING LINE

Inquiring lines›How do language models construct a…›How does AI persuasion undermine h…›What limits mechanistic interpreta…›this inquiring line

What gives a neural network its instinct to carve raw input into distinct things before it's ever told what a 'thing' is?

What inductive biases help networks segregate entities from raw inputs?

This explores the architectural and training 'inductive biases' — the built-in tendencies — that let a network pull discrete structure (modules, concepts, entities) out of undifferentiated input, rather than treating everything as one undifferentiated blob.

This reads the question as being about what built-in tendencies push a network to carve structure out of raw, unlabeled input — to separate the parts rather than smear them together. A caveat worth stating up front: this collection is centered on language models, not on the object-centric vision work where 'entity segregation from pixels' is usually studied, so the most direct answer is oblique. But several notes converge on the deeper principle, which is that segregation is something networks *acquire as a bias*, not something you have to hand-wire.

The sharpest piece of evidence is that networks segregate structure on their own. Pruning experiments show neural networks spontaneously decompose compositional tasks into isolated subnetworks — ablate one and only its corresponding function breaks Do neural networks naturally learn modular compositional structure?. That's entity-segregation at the level of *function* rather than perception: the network keeps the parts separable. Crucially, the same note finds that pretraining makes this modular structure far more consistent — so the strongest inductive bias here isn't an architecture choice, it's prior exposure.

Architecture still tilts the odds, though. Depth specifically buys you compositional separation: deep-and-thin sub-billion models beat balanced ones because layers let the network *compose abstract concepts* step by step rather than spreading capacity sideways across width Does depth matter more than width for tiny language models?. Segregating entities is exactly this kind of hierarchical build-up — primitives at the bottom, composed objects higher up — and depth is the bias that supports it.

The representational side adds a twist you might not expect: a network's tendency to use *dense* vs. *sparse* codes is itself learned from data familiarity, not fixed by the architecture. Models develop dense activations for familiar inputs and fall back to sparse ones for unfamiliar inputs, with no task-specific fine-tuning Is representational sparsity learned or intrinsic to neural networks?. Since sparsity is one of the classic levers for forcing a network to allocate distinct units to distinct entities, this says the very property you'd lean on for segregation is itself a moving, experience-shaped target.

The honest synthesis: the corpus argues that the bias that matters most for pulling entities apart is *prior training* — pretraining sharpens modularity, familiarity reshapes sparsity — with depth as the architectural assist for hierarchical composition. What it does *not* contain is dedicated object-centric / perceptual-binding work (slot attention, scene decomposition from pixels), so if your real question is about vision-style segregation, this collection answers the principle but not that specific literature.

Sources 3 notes

Do neural networks naturally learn modular compositional structure?

Pruning experiments reveal that neural networks implement compositional subroutines in isolated subnetworks, with ablations affecting only their corresponding function. Pretraining substantially increases the consistency and reliability of this modular structure across architectures and domains.

Does depth matter more than width for tiny language models?

MobileLLM shows deep-and-thin architectures yield 2.7–4.3% accuracy gains over balanced designs at 125M–350M scale by composing abstract concepts through layers rather than spreading parameters across width.

Is representational sparsity learned or intrinsic to neural networks?

During pretraining, neural networks develop dense activations for familiar training data and default to sparse representations for unfamiliar inputs. This trend emerges without task-specific fine-tuning and reflects how models consolidate knowledge through exposure.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Hierarchical Reasoning Model1.68 match · arxiv ↗
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning1.64 match · arxiv ↗
Break It Down: Evidence for Structural Compositionality in Neural Networks0.95 match · arxiv ↗
Scaling can lead to compositional generalization0.92 match · arxiv ↗
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases0.88 match · arxiv ↗
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks0.87 match · arxiv ↗
Farther the Shift, Sparser the Representation: Analyzing OOD Mechanisms in LLMs0.87 match · arxiv ↗
Scaling Laws for Neural Language Models0.86 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst. The question: *What inductive biases help networks segregate entities from raw inputs?* — framed as still-open across vision and language domains.

What a curated library found — and when (dated claims, not current truth): Findings span 2023–2026, predominantly from language-model research:

• Networks spontaneously decompose compositional tasks into modular subnetworks; ablation shows isolated function loss. Pretraining dramatically sharpens modularity (2023).
• Depth (not width) is the architectural bias enabling hierarchical composition: deep-and-thin sub-billion models outperform balanced ones, supporting step-by-step entity build-up (2024).
• Representational sparsity—a classic lever for entity segregation—is itself learned from input familiarity; models toggle between dense (familiar) and sparse (OOD) codes with no task tuning (2026).
• High-entropy minority tokens drive RL effectiveness; positional biases in prompting show structure is learnable, not hard-coded (2025).
• Reasoning and causal models may segregate reasoning chains, but consistency training and sycophancy-suppression suggest segregation competes with other objectives (2025).

Anchor papers (verify; mind their dates):
• 2301.10884 (2023): Structural Compositionality in Neural Networks
• 2402.14905 (2024): MobileLLM depth-vs-width trade-offs
• 2406.11741 (2024): Transcendence (generative models)
• 2603.03415 (2026): OOD sparsity mechanisms

Your task:
(1) RE-TEST each constraint. Has modular decomposition held up in frontier models (o1, Gemini 2.0, etc.)? Does depth remain the dominant bias, or have architectural innovations (MoE, SSM, hybrid) shifted the trade-off? Does sparsity still toggle on OOD input, or has post-training stability eroded that finding? Separate the durable principle (hierarchical composition aids segregation) from perishable limits (depth dominance, sparse-code brittleness).
(2) Surface the strongest *contradicting or superseding* work from the last 6 months — especially multi-agent, world-model, or object-centric vision papers that may have re-unified segregation and perception, or revealed that segregation harms certain capabilities (alignment, generalization).
(3) Propose 2 research questions assuming the regime shifted: *How do reasoning-chain segregation and token-space segregation interact in chain-of-thought models?* and *Does deliberate entity-desegregation (e.g., via mixture or interference) improve robustness or few-shot transfer in OOD settings?*

Cite arXiv IDs; flag anything you cannot ground in a real paper.

What gives a neural network its instinct to carve raw input into distinct things before it's ever told what a 'thing' is?

Related lines of inquiry

Sources 3 notes

Papers this line draws on 8