INQUIRING LINE

Can recurrent transformers learn genuinely new computations beyond inference stages?

This explores whether looping a transformer's layers actually produces new kinds of computation, or just re-runs the same processing steps it already does in a single pass — and what the corpus says about when recurrence buys real reasoning power.


This explores whether recurrent transformers — models that loop their layers repeatedly instead of stacking more of them — invent genuinely new operations, or just replay the inference steps a normal feedforward pass already performs. The sharpest answer in the corpus is sobering: when researchers looked inside looped models mechanistically, they found each recurrent cycle converging to a stable fixed point that mirrors and repeats the same stages a feedforward model runs, rather than computing anything new How do looped transformer layers actually behave during inference?. By that reading, the loop is a depth-amplifier, not a new kind of thinking.

But "genuinely new" depends on what you measure. Looped models with shared parameters across iterations achieve compositional generalization and depth extrapolation that vanilla transformers simply cannot — generalizing to unseen combinations of knowledge through a sharp three-phase grokking process (memorize, then in-distribution, then out-of-distribution) Can looped transformers generalize to unseen knowledge combinations?. So even if recurrence re-enacts familiar stages step-by-step, chaining those stages dynamically lets the model reach answers a fixed-depth network is provably barred from. The Hierarchical Reasoning Model pushes this further: by coupling slow abstract planning with fast detailed computation across two timescales, a 27M-parameter recurrent model solves Sudoku and mazes where chain-of-thought collapses — explicitly escaping the AC0/TC0 complexity ceiling that caps fixed-depth transformers Can recurrent hierarchies achieve reasoning that transformers cannot?. That looks less like replay and more like reaching computations the base architecture can't.

The deeper question hiding here is whether transformers ever learn systematic *rules* or only memorize the shape of computations they've seen. There's strong evidence for the pessimistic view: transformers tend to reduce compositional reasoning to linearized subgraph matching — stitching together computation paths memorized from training, then failing badly on novel compositions Do transformers actually learn systematic compositional reasoning?. If that's the baseline, then recurrence faces a hard ceiling — looping a pattern-matcher more times doesn't make it a rule-follower. Yet there's a Turing-completeness result showing a single finite transformer *can* in principle compute any computable function given the right prompt — the capacity is there; standard training just rarely finds it Can a single transformer become universally programmable through prompts?.

That gap between latent capacity and what training actually produces is where the most interesting work lives — and it suggests the answer isn't fixed by architecture alone. Self-improving transformers achieve *exponential* length generalization (10-digit to 100-digit addition) purely by generating solutions, filtering for correct ones, and retraining Can transformers improve exponentially by learning from their own correct solutions?. RL-finetuned models develop genuine in-context reinforcement learning, solving unseen problems within an episode with no weight updates at all Can transformers learn to solve new problems within episodes?. And feedback loops that let a model attend to its own latents grow emergent working memory for unbounded inputs without adding a single weight Can models learn working memory by attending to their own latents?. The pattern across these: the new computation often comes not from the recurrence itself but from what the loop lets the model *accumulate* — state, memory, self-generated training signal.

So the honest synthesis is a split decision. Mechanistically, a loop re-enacts known inference stages How do looped transformer layers actually behave during inference?. Behaviorally, recurrence breaks complexity barriers and unlocks compositional generalization that fixed depth cannot reach Can recurrent hierarchies achieve reasoning that transformers cannot? Can looped transformers generalize to unseen knowledge combinations?. The resolution is that "new computation" lives less in any single re-enacted step and more in the dynamic depth, accumulated state, and self-improvement loops that recurrence makes possible — which is why the same architecture looks like a copy machine under a microscope and a reasoning breakthrough on a benchmark.


Sources 8 notes

How do looped transformer layers actually behave during inference?

Mechanistic analysis reveals looped models converge each recurrent cycle to distinct fixed points, with attention behavior stabilizing across iterations. Recurrent blocks learn to mirror and repeat the same inference stages as feedforward models rather than compute genuinely new operations.

Can looped transformers generalize to unseen knowledge combinations?

Recurrent-depth transformers with shared parameters across iterations enable systematic generalization and depth extrapolation that vanilla transformers cannot achieve. This emerges through a sharp three-phase process: memorization, in-distribution, then out-of-distribution generalization.

Can recurrent hierarchies achieve reasoning that transformers cannot?

The Hierarchical Reasoning Model couples slow abstract planning with fast detailed computation across two timescales, achieving near-perfect performance on Sudoku and mazes where chain-of-thought methods fail completely. With only 27M parameters and 1,000 samples, HRM escapes the AC0/TC0 complexity ceiling that constrains fixed-depth transformers.

Do transformers actually learn systematic compositional reasoning?

Research shows transformers succeed on in-distribution tasks by memorizing computation subgraphs from training data, not by learning systematic rules. They fail drastically on novel compositions, with errors compounding across reasoning steps.

Can a single transformer become universally programmable through prompts?

Research proves a single finite-size transformer exists that can compute any computable function given the right prompt, achieving complexity bounds nearly matching unbounded models. However, standard training rarely produces models that learn to implement arbitrary programs this way.

Can transformers improve exponentially by learning from their own correct solutions?

Standard transformers generalize from 10-digit to 100-digit addition by repeatedly generating solutions, filtering for correctness, and retraining—showing exponential (not linear) out-of-distribution improvement across rounds without saturation.

Can transformers learn to solve new problems within episodes?

Llama 3.1 8B fine-tuned with RL exhibits emergent in-context reinforcement learning, solving unseen problems through within-episode adaptation at human-level sample efficiency. This meta-learning emerges from RL's training pressure combined with the transformer's context window, without weight updates.

Can models learn working memory by attending to their own latents?

TransformerFAM demonstrates that adding a feedback loop lets transformers attend to their own latent representations, fostering emergent working memory for indefinitely long inputs. The approach requires no additional weights and improves long-context performance at 1B, 8B, and 24B scales.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a capability analyst re-testing whether recurrent transformers learn genuinely new computations or merely replay feedforward inference stages. The question remains open.

What a curated library found — and when (dated claims, not current truth): Findings span 2023–2026.
• Mechanistic analysis: looped transformers converge to fixed points mirroring feedforward stages, re-enacting rather than inventing (2026).
• Behavioral escape: recurrent-depth models achieve compositional generalization and out-of-distribution grokking that fixed-depth networks provably cannot (2026).
• Complexity ceiling break: a 27M hierarchical dual-recurrence model solves Sudoku/mazes by escaping AC0/TC0 limits where chain-of-thought fails (2026).
• Baseline pessimism: transformers reduce compositional reasoning to linearized subgraph matching — memorizing paths, not learning rules (2024).
• Self-improvement path: models retraining on filtered correct solutions achieve exponential length generalization (10→100 digit addition) (2025).
• In-context RL: transformers develop genuine episode-level problem-solving without weight updates (2025).
• Accumulated state: feedback loops to a model's own latents grow working memory for unbounded inputs (2024).

Anchor papers (verify; mind their dates):
• arXiv:2604.11791 — Mechanistic Analysis of Looped Reasoning Language Models (2026)
• arXiv:2506.21734 — Hierarchical Reasoning Model (2026)
• arXiv:2502.01612 — Self-Improving Transformers (2025)
• arXiv:2404.09173 — TransformerFAM: Feedback Attention as Working Memory (2024)

Your task:
(1) RE-TEST EACH CONSTRAINT. The mechanistic finding (loops re-enact stages) is striking but narrow — does it hold for larger models, longer horizons, or after post-training (RL, DPO, self-play)? Does the hierarchical model's Sudoku success replicate on other reasoning tasks, or was it task-specific? Does self-improvement still require retraining, or can in-context learning now match it? Separate: the durable question (whether recurrence invents rules or amplifies depth) from perishable claims (specific fixed-point structure, specific complexity barriers).
(2) Surface the strongest CONTRADICTING work from the last ~6 months. Does recent mechanistic interpretability on in-context RL or emergent working memory undermine or sharpen the "re-enactment" thesis? Any papers showing genuinely novel algorithmic steps inside loops, or demonstrating in-context learning scales to replace self-improvement?
(3) Propose 2 research questions that assume the regime has moved: (a) If recurrence does amplify depth rather than invent algorithms, can we identify the *minimal* recurrence depth needed for each complexity class (e.g., counting, sorting, reasoning)? (b) Does in-context learning (RL, feedback loops) now obviate the need for recurrence entirely, or do they synergize?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines