SYNTHESIS NOTE

When do language models stop memorizing and start generalizing?

Can we measure the exact capacity limit where models transition from memorizing training data to learning underlying patterns? Understanding this boundary could reshape how we think about model learning and privacy.

Synthesis note · 2026-02-23 · sourced from Memory

The standard approach to measuring memorization — attempting to extract training data from the model — is fundamentally flawed. Language models can be coerced to output almost any string, so generation is not proof of memorization. Conversely, a model may memorize patterns (every other token, structural regularities) without reproducing text verbatim. Extraction is neither necessary nor sufficient.

The formal separation: unintended memorization is the information a model contains about a specific dataset (the bits that would change if a particular example were removed from training). Generalization is the information the model contains about the true data-generation process. By isolating and eliminating the generalization component, total memorization becomes measurable.

The key empirical finding: GPT-family models have an approximate capacity of 3.6 bits-per-parameter for unintended memorization. Models memorize training data until this capacity fills. At that point, a phase transition occurs — grokking begins, and unintended memorization decreases as models begin to generalize.

This reframes the grokking phenomenon mechanistically. Since What happens inside models when they suddenly generalize?, the capacity-filling measurement adds the trigger condition: grokking doesn't begin at an arbitrary training step — it begins when memorization saturates. The three phases are downstream of a capacity constraint, not of training duration per se.

The practical implication: memorization capacity is a measurable property of a specific model, not a property of the training algorithm. Two models trained by the same algorithm on the same data can have different memorization properties. This matters for privacy (which models leak more), for understanding generalization (capacity constrains when it begins), and for the Can AI pass every test while understanding nothing? question — a model that appears to generalize may simply have unfilled memorization capacity.

Inquiring lines that read this note 31

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How do training priors constrain what context information can override?

How does in-context learning trigger phase transitions in model behavior?

How does reasoning graph topology affect breakthrough insights and generalization?

Do grokking phases correspond to transitions between nesting levels?

How does memorization interact with learning and generalization?

Why do continual learning scenarios trigger catastrophic forgetting and interference?

What limits mechanistic interpretability's ability to characterize models?

Can we detect and measure circuit formation before generalization emerges?

What memory architectures best support persistent reasoning across extended interactions?

Can inference-time compute substitute for scaling up model parameters?

Where does inference compute stop substituting for model capacity?

What role does compression play in language model capability and generalization?

Why does consolidated memory sometimes degrade agent performance?

How does sequence length affect sparsity tolerance in models?

Can sparsity patterns reliably indicate how well a model knows its input?

What pretraining choices and baseline capability constrain reinforcement learning gains?

What capacity threshold determines whether RL teaches activation versus shortcut learning?

Why does finetuning cause catastrophic forgetting of model capabilities?

How does in-weight memorization scale with model parameter count?

Do language models develop causal world models or rely on statistical patterns?

What empirical evidence supports the Learning Law on real language models?

How do knowledge injection methods compare across cost and effectiveness?

When does training a memory model beat RAG or fine-tuning?

What makes weaker teacher models effective for stronger student training?

How does student capacity limit what it can learn from teachers?

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 101 in 2-hop network ·medium cluster Open in graph ↗

When do language models stop memorizing and star… What happens inside models when they suddenly gene… Can we predict keyword priming before learning hap… Can we prune training data without hurting model p…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

What happens inside models when they suddenly generalize? Grokking appears as an abrupt shift from memorization to generalization. But is the underlying process truly discontinuous, or does mechanistic analysis reveal continuous phases we can measure and predict?
capacity-filling provides the trigger mechanism for when grokking begins
Can we predict keyword priming before learning happens? Exploring whether the degree to which newly learned keywords contaminate unrelated contexts can be predicted from measurable properties before training begins, and what mechanisms enable this prediction.
a complementary view of how memorization interacts with learning
Can we prune training data without hurting model performance? This explores whether difficulty metrics can identify redundant training examples that can be safely removed. It matters because most datasets contain massive waste — if we can find which examples are truly necessary, we could train better models on far less data.
if memorization has finite capacity, pruning removes low-value items that consume capacity

When do language models stop memorizing and start generalizing?

Inquiring lines that read this note 31

Related concepts in this collection 3

Related papers in this collection 8

Search by related questions 4