Can language models learn meaning without engaging the world?
Explores whether LLMs prove that meaning emerges from relational structure alone, independent of embodied experience or external reference. Tests structuralist theory empirically.
"Computational Structuralism: Toward a Formal Theory of Meaning in the Age of Digital Intelligence" (2026) proposes a synthesis of deep learning, information theory, and French structuralism to interpret LLM success. The core argument: LLMs demonstrate that transformations over relational structure are sufficient for generating culturally and situationally specific discourse, and that such structure can be inductively derived from discourse traces alone — phenomenal or embodied engagement with the world is not a necessary condition.
The framework retraces the lineage from Saussure (language as a system of differences, meanings defined relationally) through Levi-Strauss (extending structural analysis to culture broadly, binary oppositions as compression of complexity) to Bourdieu (habitus as transposable classification schemas operating in continuous social space). LLMs trained on web text learn not just grammar but the structure of culturally situated linguistic action — which voices make which statements in response to which situations, and how audiences respond.
Key theoretical moves:
- LLMs operationalize Saussure's concept of langue — not the set of all valid statements, but the system that can interpret and generate all valid statements
- Language modeling is equivalent to text compression: removing redundancies by replacing them with generative principles. The same statistical dependencies that inform prediction compose the compressed model
- The framework privileges sufficiency over necessity — LLMs drawing on the same operations as humans is not claimed, but one way to achieve fluent natural language is now formally demonstrated
- Mechanistic interpretability offers the possibility of reverse-engineering these latent structures, answering structuralist questions (how are ideologies composed from simpler features?) with empirical methods
This challenges both sides of the grounding debate: it validates the structuralist intuition that relational form can carry meaning without referential content, while simultaneously showing that what LLMs learn is not "pure language" but socially and culturally situated discourse patterns. The concern from Can language models learn meaning from text patterns alone? (Bender & Koller) is not refuted but reframed — what counts as "sufficient" for meaning generation may not require what's necessary for meaning understanding.
Connects to Does semantic grounding in language models come in degrees? — computational structuralism explains why functional grounding succeeds: the relational structure of discourse is compressible and learnable. The question is whether this constitutes meaning or merely its simulation.
Inquiring lines that use this note as a source 118
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can fixing hallucination address AI's structural epistemic problem?
- Can secondary orality exist without any embodied human participant at all?
- Can we develop competent reading practices for disembodied orality?
- Can relational value exist without a person behind the output?
- Can knowledge flow without an embodied carrier transmitting it?
- How does training data preserve communicative event structure without the actual events?
- Do language models raise validity claims in the Habermasian sense?
- Can you separate grammatical competence from rhetorical commitment in language systems?
- Can a relational entity bear psychological properties the way Chalmers claims?
- Can LLMs infer situational context the way humans do pragmatically?
- Why does frame-activation matter more than word-by-word composition?
- How does enactive theory define language differently than computational linguistics?
- Can linguistic agency exist without embodiment and real-world participation?
- Why does training data saliency distort how models judge meaning?
- How do humans learn language through communication differently than LLM text prediction?
- How do low-dimensional representation structures entangle multiple cultures together?
- Can mechanistic interpretability reveal how ideologies decompose into simpler features?
- How does mechanistic interpretability reveal ideological structures in language model weights?
- What makes relational structure sufficient for generating contextually appropriate discourse?
- Does functional grounding through discourse patterns count as genuine semantic meaning?
- What makes internal embeddings useful as multimodal input for language model training?
- Why do users attribute consciousness to language models in practice?
- How does syntactic encoding relate to semantic feature representation?
- How does Peircean Secondness differ from what RLHF actually provides?
- Can statistical learning from language alone capture all aspects of cultural competence?
- Do language models learn surface patterns instead of underlying linguistic principles?
- Can implicit linguistic information ever be reliably learned from training data?
- What makes a relational act different from just moving content around?
- What makes linguistic agency impossible for systems without embodiment?
- How does semantic grounding differ between human minds and language models?
- Can language models reason without relying on learned semantic patterns?
- How does monological training on text differ from dialogical training in conversation?
- Can large language models understand language without embodied grounding systems?
- Does next-token prediction alone produce genuine functional language competence?
- Can language about model behavior ever be accurate without anthropomorphic framing?
- Can LLMs improve at metaphor if they handle decoupled semantics better?
- How does implicit meaning processing limit LLM pragmatic reasoning?
- Can language models acquire meaning from distributional patterns alone without joint attention?
- Why do language models fail at implicit discourse relations while handling explicit connectives?
- Does embodiment and interaction matter for linguistic competence beyond pattern learning?
- What architectural changes would let language models develop genuine functional competence?
- Can LLMs participate meaningfully in discourse without consciousness or understanding?
- Can correct model outputs prove that semantic meaning rather than surface patterns drove the response?
- Can frame semantics explain why context matters more than word similarity?
- Does selective suppression of linguistic relations enable human meaning-making?
- Why do explicit discourse connectives help LLMs but implicit relations cause failures?
- What structural signals in user language reveal their unstated preferences and context?
- What role does language play as a cognitive scaffold versus communication tool?
- Does embodiment matter for genuine linguistic agency?
- How does the symbol grounding problem apply to artificial language systems?
- What role does joint attention play in how humans learn language meaning?
- Can language meaning emerge without joint attention and shared embodied interaction?
- Can LLMs infer implicit meaning without surface linguistic markers?
- What distinguishes surface cues from structural meaning in language understanding?
- What structural limits prevent LLMs from abstracting moral principles?
- How do world models create indirect causal grounding without physical environment contact?
- How does embodiment affect whether LLMs can participate in Wittgensteinian language games?
- Can LLMs predict social norms without deep integration into linguistic practices?
- How can structurally different text produce equivalent real-world effects?
- Can language models develop world models that ground meaning in causal reality?
- Can understanding language happen entirely within a language system alone?
- How do internal representations compare to human cognitive structures?
- Do language models actually learn linguistic structure or just surface statistics?
- Can LLMs develop genuine understanding without embodied experience?
- What structural properties of language models make fabrication inevitable?
- Do metaphors work by decoupling meaning from linguistic associations?
- Can LLMs identify implicit metaphoric mappings that require pragmatic inference?
- Can LLM semantic representations exist without causally influencing their generation output?
- Do language models encode deep syntactic structure or only surface-level patterns?
- Why does LLM compression eliminate causal grounding in conceptual representations?
- Can encoder models match human conceptual structure better than larger language models?
- Why do explicit discourse connectives work when implicit relations fail?
- What distinguishes surface generalizations from true linguistic generalizations?
- Why do surface generalizations fail on unusual syntactic structures?
- Does DPO training with coreference chains teach spontaneous convention formation?
- Do LLMs learn linguistic generalizations or just surface-level frequency patterns?
- Can formal language pretraining address surface generalization without learning true linguistic structure?
- Do LLMs learn surface patterns instead of genuine linguistic structure?
- Why do explicit linguistic markers override semantic computation in models?
- Can functional behavior alone capture what makes something a genuine belief?
- How does bidirectional entailment distinguish semantic equivalence from token similarity?
- Why do language models reproduce human EPA structure despite different architecture?
- Can functional semantic grounding substitute for true causal grounding?
- What does embodiment and precariousness mean for linguistic agency?
- How does Wittgenstein's language games explain social grounding in LLMs?
- Do speech encoders actually learn the physics of how vocal tracts produce sound?
- How do language models transmit traits through semantically unrelated data?
- Why does conceptual priming alone fail to produce consciousness claims?
- How does interleaving reasoning with action prevent hallucination in language models?
- Can a virtual instance be individuated from its conversational context?
- What makes the Extended Mind thesis incompatible with internalism?
- What distinguishes real understanding from superficial pattern matching?
- What role does the biological substrate play in human relational identity?
- Why does joint attention matter for acquiring linguistic meaning?
- Can statistical learning from text replace embodied cultural experience?
- Can language models learn internal world models without explicit environment specifications?
- Are static embeddings analogous to the formal linguistic competence layer?
- How do static embeddings and contextualized representations divide semantic labor?
- Can LLMs reason through semantics without understanding causal mechanisms?
- Do distributed relational tasks consistently underperform local classification across NLP domains?
- How do pretrained language models represent inferential patterns versus lexical and positional cues?
- How does subject-predicate distinction emerge from formal linguistic analysis?
- Why does the Chinese Room argument miss the deeper abstraction problem?
- Can external actions provide causal necessity that language models lack?
- How do language models infer their own mental states like humans do?
- Can pragmatic competence emerge from text exposure alone without interactive grounding?
- What are the scaling law differences between vision and language learning?
- Why does gradient descent discover compositional structure without explicit pressure?
- Does language convey meaning purely through relational structure without external grounding?
- What makes human language fundamentally different from what language models produce?
- How does co-occurrence statistics alone produce hierarchical concept organization?
- Do language models need words to think or just latent structure?
- Can readers detect meaning through resonance patterns alone without knowing authorial intent?
- Where does the meaning actually originate in reader-detected resonance across language?
- How does scaling and training data enable compositional behavior without symbolic mechanisms?
- How should we rethink the symbolism versus connectionism debate in light of LLMs?
- How do semantic features in representations become steerable task-specific directions?
- Why do language models need external temporal signals at all?
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Computational structuralism: Toward a formal theory of meaning in the age of digital intelligence
- Mechanistic Indicators of Understanding in Large Language Models
- From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
- Semantic Structure in Large Language Model Embeddings
- CoT is Not True Reasoning, It Is Just a Tight Constraint to Imitate: A Theory Perspective
- Probing Structured Semantics Understanding and Generation of Language Models via Question Answering
- Language Models’ Hall of Mirrors Problem: Why AI Alignment Requires Peircean Semiosis
- From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Models
Original note title
LLMs operationalize Saussures langue — fully relational models with no external referents suffice to generate contextually appropriate discourse