INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›How do surface signals and framing…›What factors beyond surface conten…›this inquiring line

When AI learns word relationships from raw text, the hierarchy that emerges looks mathematically identical to one you'd deliberately engineer.

What spectral signatures distinguish hierarchy-driven geometry from corpus-driven geometry?

This explores whether you can tell, from the spectral fingerprint of an embedding space, that hierarchy was 'built in' versus that it fell out of raw text statistics — and the corpus's surprising answer is that there's no distinguishing signature, because the two are the same thing.

This explores whether you can look at the spectral fingerprint of an embedding space and tell apart geometry that comes from a built-in hierarchy mechanism versus geometry that simply precipitates out of word-counting. The sharp finding across the corpus is that the distinction you're reaching for mostly collapses: the spectral signature that *looks* hierarchy-driven turns out to be corpus-driven all the way down.

The clearest tell is the coarse-to-fine eigenvector order. When you take the Gram matrix of an embedding space and look at its leading eigenvectors, they split the vocabulary along broad taxonomic branches first, then progressively finer sub-branches, tracking the WordNet hypernym tree level by level Do embedding eigenvectors organize taxonomy from coarse to fine?. That layered spectral order is exactly what you'd predict from co-occurrence statistics alone — no hierarchy-specific machinery required. So the signature isn't evidence of a dedicated hierarchy mechanism; it's evidence of how often words show up near each other Where does hierarchical structure in language models come from?.

The knockout argument comes from comparing models that should have nothing in common. Word2vec embeddings and Gemma 2B unembeddings — trained with entirely different objectives — carry *identical* coarse-to-fine spectral signatures across WordNet taxonomies Do language models use the hierarchical geometry they inherit?. If the geometry were driven by some functional need for hierarchy, two such different systems wouldn't converge on the same eigenstructure. The shared fingerprint can only come from the one thing they share: the statistics of training text. In other words, when you spectrally decompose either model, you're reading the corpus, not the architecture.

What does a genuinely *different* geometric channel look like, for contrast? The Polar Probe work shows that syntactic relations live in a separate, angular code — type and direction of grammatical relations are encoded through both distance and angular position, not the radial coarse-to-fine nesting that taxonomy uses How do language models encode syntactic relations geometrically?. That's the useful lateral move: the corpus has more than one kind of structured geometry, and they have distinguishable signatures (radial-nested for taxonomy, polar-angular for syntax) — but the meaningful axis of difference is *taxonomy vs. syntax*, not *hierarchy-mechanism vs. corpus*.

Worth knowing the caveat: a clean spectral signature doesn't guarantee the model actually *uses* the structure well. Models can carry all the linearly decodable features for a task while their internal organization is fractured and fragile Can models be smart without organized internal structure?, and grammatical competence degrades predictably as structural depth and recursion increase Does LLM grammatical performance decline with structural complexity?. So the spectrum tells you the geometry inherited from the corpus is *there* — it doesn't tell you the model reliably reasons over it.

Sources 6 notes

Do embedding eigenvectors organize taxonomy from coarse to fine?

Leading eigenvectors of embedding Gram matrices separate broad taxonomic branches first, then progressively finer sub-branches—a coarse-to-fine spectral order that tracks the WordNet hypernym tree level by level, confirming predictions from co-occurrence statistics.

Where does hierarchical structure in language models come from?

LLM hierarchical representations arise as a direct mathematical consequence of corpus statistics, not from hierarchy-specific mechanisms. Spectral analysis of word co-occurrence matrices predicts and reproduces the same nested geometry found in trained embeddings and word2vec models.

Do language models use the hierarchical geometry they inherit?

Word2vec embeddings and Gemma 2B unembeddings share identical coarse-to-fine spectral signatures across WordNet taxonomies. Since these models have entirely different objectives, the shared structure must originate from training text statistics rather than convergent functional needs.

How do language models encode syntactic relations geometrically?

The Polar Probe shows LLMs represent syntactic type and direction through both distance and angular position between embeddings, nearly doubling accuracy over distance-only methods. This demonstrates neural networks spontaneously learn structured, symbolic-compatible geometry.

Can models be smart without organized internal structure?

Models trained with SGD can contain all the linearly decodable features needed for a task while maintaining fundamentally broken internal organization. This makes them vulnerable to perturbation and distribution shift invisible to standard evaluation metrics.

Show all 6 sources

Does LLM grammatical performance decline with structural complexity?

LLMs show systematic performance decline as syntactic depth and embedding increase. Simple sentences are handled well while complex structures with recursion and embedding fail consistently, suggesting LLMs learned surface heuristics rather than structural grammar rules.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Hierarchical Concept Geometry in Language Models Emerges from Word Co-occurrence3.54 match · arxiv ↗
Semantic Structure in Large Language Model Embeddings3.24 match · arxiv ↗
Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control3.22 match · arxiv ↗
Break It Down: Evidence for Structural Compositionality in Neural Networks2.41 match · arxiv ↗
A polar coordinate system represents syntax in large language models1.70 match · arxiv ↗
Large Linguistic Models: Investigating LLMs' metalinguistic abilities1.66 match · arxiv ↗
Computational structuralism: Toward a formal theory of meaning in the age of digital intelligence1.64 match · arxiv ↗
From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning1.59 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a mechanistic interpretability researcher investigating whether embedding spectral signatures reliably distinguish hierarchy-specific architectural mechanisms from corpus-driven geometry. The question remains open: can you use spectral decomposition to tell apart *designed* hierarchy from *emergent* hierarchy?

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat these as perishable constraints:
• Coarse-to-fine eigenvector ordering mirrors WordNet taxonomy levels, but arises entirely from co-occurrence statistics—no dedicated hierarchy mechanism needed (~2026).
• Word2Vec and Gemma 2B unembeddings share identical hierarchical spectral signatures despite wholly different training objectives, suggesting corpus statistics, not architecture, drive the eigenstructure (~2026).
• Syntactic relations occupy a distinct polar-angular code (type and direction encoded via angle and distance), separating taxonomy geometry (radial-nested) from grammar geometry (angular), but both originate in corpus distribution (~2024-12).
• Spectral decodability does not guarantee functional use: models carry taxonomic structure yet show predictable degradation on deep recursion and symbolic tasks (~2025-03, ~2507.10624).
• Identical performance metrics can mask fragmented internal representations, decoupling geometric presence from reliable reasoning (~2025).

Anchor papers (verify; mind their dates):
• arXiv:2605.23821 (2026-05): Hierarchical Concept Geometry emerges from co-occurrence.
• arXiv:2412.05571 (2024-12): Polar coordinate system for syntax.
• arXiv:2507.10624 (2025-07): Architectural limits in symbolic tasks.
• arXiv:2503.19260 (2025-03): Linguistic blind spots and structural degradation.

Your task:
(1) RE-TEST THE COLLAPSE CLAIM. For each finding, judge whether recent model scaling, training data curation, attention mechanisms, or emergent specialization have *widened* the spectral gap between hierarchy-driven and corpus-driven geometry. Can you now distinguish them? Separate the durable question (what does spectral structure reveal about reasoning?) from the perishable constraint (corpus explains all variance). Cite concretely what relaxed it.
(2) Surface the strongest DISAGREEMENT: are there papers arguing that some models *do* implement dedicated hierarchy mechanisms with distinct spectral signatures? Or that spectral signatures *don't* predict reasoning competence at all?
(3) Propose 2 research questions that assume the regime has shifted: (a) If spectral hierarchy is now corpus-universal across model families, does *angular alignment* with external taxonomies (e.g., human-rated hierarchies) predict task performance? (b) Do multimodal or reinforcement-learning-trained models show spectral signatures that diverge from text-corpus geometry, suggesting non-corpus-driven hierarchy?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

When AI learns word relationships from raw text, the hierarchy that emerges looks mathematically identical to one you'd deliberately engineer.

Related lines of inquiry

Sources 6 notes

Papers this line draws on 8