Can we transfer reasoning structure without copying surface form?
This explores whether the *logical scaffolding* of reasoning — the moves that get you from problem to answer — can be carried across tasks or models independently of the exact words, format, and stylistic dressing it arrived in.
This explores whether reasoning's underlying structure can travel separately from its surface form — its specific wording, length, and format. The corpus has a sobering starting point: a cluster of work argues that today's chain-of-thought may be *nothing but* surface form. Several notes converge on the claim that CoT reproduces the shape of reasoning through learned pattern-matching rather than genuine inference Does chain-of-thought reasoning reveal genuine inference or pattern matching? What makes chain-of-thought reasoning actually work? Why does chain-of-thought reasoning fail in predictable ways?. The evidence is that performance degrades predictably the moment you shift task, length, or format Does chain-of-thought reasoning actually generalize beyond training data?, and that format and spatial layout shape the answer roughly 7.5× more than the actual domain content — invalid prompts even work as well as valid ones What makes chain-of-thought reasoning actually work?. If structure and surface are this entangled, transferring one without the other looks hard.
But the same corpus suggests the two *are* separable, and shows a few ways to pry them apart. The most striking is the finding that verbose and concise reasoning live in distinct, linear regions of a model's activation space — meaning "how much surface form" is a single steerable direction you can dial down by 67% without losing accuracy Can we steer reasoning toward brevity without retraining?. That's almost a literal demonstration of the question: the structure survives while the surface is compressed away.
The more interesting transfer stories work by extracting structure into a form that *isn't* prose at all. Reconstructing the hidden thought processes behind expert texts — the self-talk, recall, and verification that the polished writing left out — produces reasoning skills that genuinely transfer across domains, beating standard pretraining by up to 8 points on hard problems Can reconstructing expert thinking improve reasoning transfer?. The framing there is sharp: expert writing is the *surface residue* of a thinking process, and recovering the process is what transfers. Similarly, externalizing reasoning into knowledge-graph triples lets a small model punch far above its weight on hard tasks Can structuring reasoning as knowledge graphs help smaller models solve complex tasks?, and training abstraction generators separately from solution generators turns "structure" into a reusable object that guides breadth-first exploration Can abstractions guide exploration better than depth alone?. In each case the trick is the same: name the structure explicitly so it stops being trapped inside one task's wording.
A third angle questions whether prose was ever the right container. Diffusion LLMs decouple reasoning from answering into separate refinement axes, so the two no longer have to share a single left-to-right surface form Can reasoning and answers be generated separately in language models?. Energy-based transformers locate reasoning in iterative energy minimization rather than in generated text, and generalize better out-of-distribution without domain-specific scaffolding Can energy minimization unlock reasoning without domain-specific training? — which is exactly the symptom you'd expect if the structure had detached from surface idiosyncrasies. And grounding reasoning in external action rather than internal narration prevents the error propagation that pure CoT suffers Can interleaving reasoning with real-world feedback prevent hallucination?.
The quiet payoff: the reason surface-bound CoT fails to transfer and the reason these methods succeed are the same reason. Distribution-bounded CoT *is* surface form with no separable structure underneath, so it can't generalize. The methods that transfer all do one thing first — they give the reasoning structure an explicit, non-prose representation (a vector, a graph, a reconstructed thought trace, an energy landscape). So the answer is a qualified yes: you can transfer structure without surface form, but only once you've extracted the structure into something that isn't surface form to begin with.
Sources 12 notes
CoT works by constraining models to reproduce familiar reasoning patterns from training, not by enabling novel symbolic reasoning. Performance degrades predictably under distribution shifts—the signature of imitation rather than capability emergence.
CoT systems reproduce the form of reasoning through pattern matching rather than performing genuine logical inference. This explains why format effects dominate content, why structurally invalid prompts succeed, and why stronger reasoning models become less instruction-compliant.
CoT guides models to pattern-match reasoning structure rather than perform genuine inference. This explains distribution-bounded failures, why structural coherence matters more than content correctness, and why performance optimizes against interpretability.
DataAlchemy experiments show CoT fails systematically under distributional shifts in task, length, and format. Models produce fluent but logically inconsistent reasoning — imitating reasoning form without valid underlying logic.
Research shows training format shapes reasoning strategy 7.5× more than domain, demo position swings accuracy 20%, and invalid CoT prompts work as well as valid ones. CoT is pattern-guided generation, not formal logic.
Activation-Steered Compression extracts a single vector from 50 paired examples to reduce chain-of-thought length by 67% while maintaining accuracy and achieving 2.73x speedup. The method is training-free and generalizes across model sizes and domains.
Training on expert texts augmented with reconstructed thought processes (self-talk, knowledge recall, verification) produces reasoning skills that transfer across domains and adapt depth to problem difficulty, outperforming standard continual pretraining by up to 8 points on hard problems.
Knowledge Graph of Thoughts (KGoT) achieves 29% improvement on GAIA Level 3 tasks using GPT-4o mini by externalizing reasoning into iteratively constructed KG triples. The approach improves transparency, reduces bias, and enables quality control over reasoning steps.
RLAD jointly trains abstraction and solution generators, showing that allocating test-time compute to diverse abstractions outperforms parallel solution sampling at large budgets. Abstractions create structured breadth-first exploration that prevents the underthinking failure mode of depth-only reasoning chains.
ICE shows that bidirectional attention in diffusion LLMs enables in-place prompting—embedding reasoning directly in masked positions refined alongside answers. Answer confidence converges early while reasoning continues refining, allowing early-exit mechanisms to cut compute by 50% while maintaining accuracy.
Energy-Based Transformers assign energy values to input-prediction pairs and use gradient descent minimization for inference, yielding 35% higher training scaling rates and 29% more inference-compute gains than Transformer++, while generalizing better on out-of-distribution data without domain-specific scaffolding.
ReAct demonstrates that alternating verbal reasoning with external tool queries (Wikipedia API, environment interaction) prevents error propagation by injecting real-world feedback at each step. On knowledge-intensive and interactive tasks, this approach outperforms pure chain-of-thought and reinforcement learning by 10-34% absolute accuracy.