Are language models developing real functional competence or just formal competence?

Neuroscience suggests formal linguistic competence (rules and patterns) and functional competence (real-world understanding) rely on different brain mechanisms. Can next-token prediction alone produce both, or does it leave functional competence behind?

Synthesis note · 2026-02-21 · sourced from Philosophy Subjectivity

Fedorenko and colleagues (Dissociating language and thought) ground the LLM competence debate in neuroscience. Formal linguistic competence — knowledge of linguistic rules and patterns, grammatical structure, syntactic regularities — relies on dedicated language circuits in the brain. Functional linguistic competence — understanding and using language in the world — requires integration of diverse brain networks beyond language circuits: memory, reasoning, social cognition, sensorimotor systems.

The critical finding: word-in-context prediction, the training objective of most LLMs, produces formal competence as an emergent outcome. It does not and cannot produce functional competence, because functional competence requires the integration of systems that are architecturally distinct in the brain and not activated by the prediction objective.

LLMs are "qualitatively different in their formal linguistic capacities from models before roughly 2018" — a genuine discontinuity in formal competence. But this formal competence arises from an objective that leaves functional competence behind. The two competences are not on a continuum; they are served by different mechanisms.

The predictive implication is architectural. Models that succeed at real-life language use will need to mimic the division of labor between formal and functional competence in the human brain — through modularity: separate circuits for form-level processing and for world-connected functional processing. LLMs that add retrieval, tool use, and memory may be approximating this modularity, but from the outside rather than by design.

This is distinct from Bender & Koller's claim that meaning cannot be acquired from form alone (which rests on the joint-attention/communicative-intent argument). The Fedorenko finding adds a mechanistic neuroscience foundation: even if we grant that some meaning can emerge from distributional learning, the kind of competence that requires world integration is neurologically segregated and cannot be produced by the same mechanism as syntactic pattern-learning.

Inquiring lines that read this note 11

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

Do language models learn genuine linguistic structure or just surface patterns?

Is embodied interaction necessary for language meaning and genuine agency?

Can next-token prediction alone produce genuine language understanding?

Does next-token prediction alone produce genuine functional language competence?

Do language models perform faithful symbolic reasoning independent of semantic grounding?

Do LLMs have functional linguistic competence or only formal language ability?

Do language models understand semantics or rely on pattern matching?

What's the difference between formal and functional linguistic competence?

How do formal dialogue structures reveal conversation coherence mechanisms?

What distinguishes communicative competence from human-like dialogue ability?

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

19 direct connections · 152 in 2-hop network ·medium cluster Open in graph ↗

Are language models developing real functional c… Can language models learn meaning from text patter… What makes linguistic agency impossible for langua… Why does ChatGPT fail at implicit discourse relati…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can language models learn meaning from text patterns alone? Explores whether training on form alone—predicting the next word from prior words—could ever give language models access to communicative intent and genuine semantic understanding.
shares the formal/functional gap; Fedorenko adds neuroscience mechanism; Bender/Koller add communicative-intent argument
What makes linguistic agency impossible for language models? From an enactive perspective, does linguistic agency require embodied participation and real stakes that LLMs fundamentally lack? This matters because it challenges whether LLMs can truly engage in language or only generate text.
enactive view is a third account of why functional competence requires more than formal pattern-learning
Why does ChatGPT fail at implicit discourse relations? ChatGPT excels when discourse connectives are present but drops to 24% accuracy without them. What does this gap reveal about how LLMs actually process meaning and logical relationships?
behavioral evidence for formal/functional gap: explicit connectives (formal cues) work; implicit relations (functional understanding) fail

Are language models developing real functional competence or just formal competence?

Inquiring lines that read this note 11

Related concepts in this collection 3

Related papers in this collection 8

Search by related questions 4