INQUIRING LINE

Are static embeddings analogous to the formal linguistic competence layer?

This explores whether the static word-embedding layer of a transformer — what each token 'means' before attention mixes anything together — maps onto the 'formal linguistic competence' idea: a model's grasp of language structure as separate from using language to reason about the world.


This reads the question through the split that linguists draw between *formal* competence (knowing the rules and meanings of a language) and *functional* competence (using language to think and act in the world) — and asks where static embeddings sit. The corpus suggests static embeddings are real linguistic knowledge, but they're closer to the *lexicon* than to the formal-grammar layer the question names.

The strongest evidence that embeddings carry genuine knowledge comes from clustering analysis of RoBERTa's static vectors, which turn out to be sensitive to psycholinguistic properties like valence, concreteness, and even taboo — meaning each word arrives at the model already loaded with semantic content before self-attention does anything Do transformer static embeddings actually encode semantic meaning?. So embeddings aren't empty slots waiting to be filled by context; they function as standalone lexical entries. That's a competence layer in the sense that the knowledge is *in there* — but it's word-level meaning, not the structural machinery of grammar.

Where the analogy strains is structure. If static embeddings were the formal competence layer, you'd expect models to handle syntax cleanly. They don't: top models systematically misread embedded clauses, complex verb phrases, and nested nominals, and the errors get predictably worse as syntactic depth increases Why do large language models fail at complex linguistic tasks?. That points to the real formal-competence work happening in *composition* — what attention does across layers — not in the embedding lookup. The embeddings supply the pieces; the failures show up when the pieces have to be assembled by grammatical rules the model only approximates statistically.

There's a deeper framing here worth surfacing. One line of work argues LLMs operationalize Saussure's *langue* — they learn meaning purely as relational structure compressed from text, with no external referents Can language models learn meaning without engaging the world?. Under that view the whole model, embeddings included, is a formal-competence engine: it masters the internal relations of language without grounding. Static embeddings would then be the most relational layer of all — meaning defined entirely by neighbors. And efforts to move reasoning *up* to sentence-level embeddings, as in Large Concept Models, are essentially a bet that the formal/relational layer can carry abstraction on its own, language-agnostically Can reasoning happen at the sentence level instead of tokens?.

The thing you might not have expected to learn: the formal-competence analogy holds best precisely where models look most impressive and breaks where they look smartest. Embeddings nail word meaning and relational structure (formal), but the same architecture defaults to surface heuristics the moment a task needs genuine world- or mind-modeling — theory-of-mind benchmarks show models faking perspective-taking rather than tracking beliefs Do large language models genuinely simulate mental states?. So static embeddings aren't the whole formal layer — they're its lexical floor, and the gap they expose is exactly the formal-vs-functional boundary itself.


Sources 5 notes

Do transformer static embeddings actually encode semantic meaning?

Clustering analysis of RoBERTa embeddings reveals sensitivity to five psycholinguistic measures including valence, concreteness, iconicity, and taboo. This demonstrates that static embeddings function as genuine lexical entries containing semantic content before self-attention operates.

Why do large language models fail at complex linguistic tasks?

Top-tier LLMs like Llama3-70b consistently misidentify embedded clauses, verb phrases, and complex nominals. Performance degrades predictably as syntactic depth increases, revealing that statistical learning captures surface patterns but not deep grammatical rules.

Can language models learn meaning without engaging the world?

Research shows LLMs learn culturally situated discourse patterns by compressing relational structure from text, demonstrating that fluent language generation requires no external referents or embodied grounding.

Can reasoning happen at the sentence level instead of tokens?

Meta's Large Concept Model operates on sentence embeddings rather than tokens, reasoning in a language-agnostic space before decoding to any target language. This hierarchical approach with paragraph-level planning produces more coherent output than flat token generation.

Do large language models genuinely simulate mental states?

ChangeMyView and FANTOM benchmarks show LLMs fail at authentic perspective-taking in open-ended scenarios, despite succeeding on structured tasks. Hybrid Bayesian architectures that force explicit belief tracking outperform LLM-alone approaches, suggesting the gap is architectural rather than merely training-based.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a linguistic researcher evaluating whether static embeddings in LLMs map onto formal linguistic competence—a claim that a curated arXiv library (2024–2025) treated as partly true but structurally fragile. The question remains open: what *exactly* is the competence layer, and has that shifted?

What a curated library found—and when (dated claims, not current truth):
• Static embeddings encode lexical semantics (valence, concreteness, taboo sensitivity) *before* self-attention operates (~2024–2025)—they function as real word-level competence, not empty slots.
• Models systematically fail on embedded clauses, complex verb phrases, and nested structures with errors that worsen predictably at depth (~2025)—suggesting formal-competence work happens in *composition* (cross-layer attention), not lookup.
• LLMs default to surface-level strategies in theory-of-mind tasks rather than genuine belief-tracking (~2025)—exposing the formal/functional boundary within the same architecture.
• One framing treats LLMs as operationalizing Saussure's *langue*: meaning as purely relational structure with no external grounding (~2024–2025).
• Large Concept Models bet that sentence-level (not word-level) embeddings can carry language-agnostic abstraction (~2024–2025).

Anchor papers (verify; mind their dates):
• arXiv:2503.19260 (2025-03) *Linguistic Blind Spots of Large Language Models*
• arXiv:2502.08796 (2025-02) *A Systematic Review on the Evaluation of Large Language Models in Theory of Mind Tasks*
• arXiv:2508.12863 (2025-08) *Word Meanings in Transformer Language Models*
• arXiv:2507.13966 (2025-07) *Bottom-up Domain-specific Superintelligence*

Your task:
(1) RE-TEST EACH CONSTRAINT. For syntactic depth failures: have self-supervised pretraining, retrieval-augmented decoding (2024-06 long-context models), or new reasoning pathways (continuous latent space reasoning, 2024-12) since loosened those bottlenecks? For theory-of-mind surface heuristics (2025-02): do reinforcement-learning fine-tuning paths (2025-07) or mechanistic-interpretability indicators (2025-07) now reveal genuine perspective-tracking under the surface? Separate the durable question—what is the actual *composition* layer doing?—from the perishable limitation (perhaps newer architectures or training regimes have shifted it).
(2) Surface work from the last 6 months that contradicts the library's framing. Does arXiv:2507.13966 (knowledge graphs) or arXiv:2508.10736 (in-place prompting in diffusion LLMs) suggest embeddings are *less* central to competence than the library claims?
(3) Propose 2 research questions that assume the regime has moved: (a) If sentence-level (not word-level) embeddings now carry the formal layer, what structural properties distinguish them from static word embeddings? (b) If LLMs have learned to separate surface heuristics from genuine reasoning via self-improvement (2025-01 self-adaptive models; 2025-07 self-feedback RL), does that collapse the formal/functional boundary?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines