SYNTHESIS NOTE

Can language models learn meaning without engaging the world?

Explores whether LLMs prove that meaning emerges from relational structure alone, independent of embodied experience or external reference. Tests structuralist theory empirically.

Synthesis note · 2026-04-18 · sourced from Linguistics, NLP, NLU

"Computational Structuralism: Toward a Formal Theory of Meaning in the Age of Digital Intelligence" (2026) proposes a synthesis of deep learning, information theory, and French structuralism to interpret LLM success. The core argument: LLMs demonstrate that transformations over relational structure are sufficient for generating culturally and situationally specific discourse, and that such structure can be inductively derived from discourse traces alone — phenomenal or embodied engagement with the world is not a necessary condition.

The framework retraces the lineage from Saussure (language as a system of differences, meanings defined relationally) through Levi-Strauss (extending structural analysis to culture broadly, binary oppositions as compression of complexity) to Bourdieu (habitus as transposable classification schemas operating in continuous social space). LLMs trained on web text learn not just grammar but the structure of culturally situated linguistic action — which voices make which statements in response to which situations, and how audiences respond.

Key theoretical moves:

LLMs operationalize Saussure's concept of langue — not the set of all valid statements, but the system that can interpret and generate all valid statements
Language modeling is equivalent to text compression: removing redundancies by replacing them with generative principles. The same statistical dependencies that inform prediction compose the compressed model
The framework privileges sufficiency over necessity — LLMs drawing on the same operations as humans is not claimed, but one way to achieve fluent natural language is now formally demonstrated
Mechanistic interpretability offers the possibility of reverse-engineering these latent structures, answering structuralist questions (how are ideologies composed from simpler features?) with empirical methods

This challenges both sides of the grounding debate: it validates the structuralist intuition that relational form can carry meaning without referential content, while simultaneously showing that what LLMs learn is not "pure language" but socially and culturally situated discourse patterns. The concern from Can language models learn meaning from text patterns alone? (Bender & Koller) is not refuted but reframed — what counts as "sufficient" for meaning generation may not require what's necessary for meaning understanding.

Connects to Does semantic grounding in language models come in degrees? — computational structuralism explains why functional grounding succeeds: the relational structure of discourse is compressible and learnable. The question is whether this constitutes meaning or merely its simulation.

Inquiring lines that read this note 122

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

Can language model hallucination be prevented or only managed?

Does conversational format create illusions of genuine AI communication?

Does tokenized intelligence retain genuine value through exchange-based systems?

Can relational value exist without a person behind the output?

Is embodied interaction necessary for language meaning and genuine agency?

Why do language models reinforce false assumptions instead of correcting them?

Do language models learn genuine linguistic structure or just surface patterns?

How do LLMs distinguish causal reasoning from temporal and semantic associations?

How do language models establish social grounding in human dialogue?

How do training priors constrain what context information can override?

What limits mechanistic interpretability's ability to characterize models?

Why do language models struggle with implicit discourse relations?

What articulatory information do speech signals carry that text cannot?

Is model self-awareness based on genuine introspection or pattern matching?

Do language models understand semantics or rely on pattern matching?

How should models express uncertainty rather than forced confident answers?

How does Peircean Secondness differ from what RLHF actually provides?

How do formal dialogue structures reveal conversation coherence mechanisms?

Do language models perform faithful symbolic reasoning independent of semantic grounding?

Can language models reason without relying on learned semantic patterns?

Can next-token prediction alone produce genuine language understanding?

Does next-token prediction alone produce genuine functional language competence?

Why do benchmark improvements fail to reflect actual reasoning quality?

Can correct model outputs prove that semantic meaning rather than surface patterns drove the response?

Do language models develop causal world models or rely on statistical patterns?

Why can't humans reliably detect AI-generated text despite measurable linguistic signatures?

How can structurally different text produce equivalent real-world effects?

Do language model representations contain causally steerable task-specific features?

Can LLM personas constitute genuine psychology or remain linguistic role-play?

What role does the biological substrate play in human relational identity?

What critical LLM failures do standard benchmarks hide?

Do distributed relational tasks consistently underperform local classification across NLP domains?

Why do reasoning models fail at systematic problem-solving and search?

Why does the Chinese Room argument miss the deeper abstraction problem?

Do autonomous architecture discoveries follow predictable scaling laws?

What are the scaling law differences between vision and language learning?

Does model scaling alone produce compositional generalization without symbolic mechanisms?

What factors beyond surface content determine how readers extract meaning differently?

Can readers detect meaning through resonance patterns alone without knowing authorial intent?

How does rhetorical adaptation affect LLM persuasion and detectability?

How do different LLMs converge on similar argumentative structures independently?

Does recurrence enable reasoning capabilities that fixed-depth transformers cannot achieve?

Can transformers abstract relational structure without explicit symbolic machinery?

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

LLMs operationalize Saussures langue — fully relational models with no external referents suffice to generate contextually appropriate discourse

Can language models learn meaning without engaging the world?

Inquiring lines that read this note 122

Related papers in this collection 8

Search by related questions 4