INQUIRING LINE

Inquiring lines›How should agents manage and coord…›How can training approaches develo…›How do training data properties sh…›this inquiring line

For AI and humans alike, studying worked explanations may teach more than grinding out correct answers from scratch.

Why do students learn better from explanations than from solving problems from scratch?

This reads 'students' as learning models (and the learners they stand in for), asking why engaging with worked-out reasoning or critiques of solutions can beat grinding problems out unaided — and the corpus suggests the answer is about what kind of signal each path actually carries.

This explores why a 'student' — whether a model being trained or a person — often gains more from studying explanations and critiques than from solving problems cold, and the collection has a surprisingly direct answer: engaging with reasoning that's already laid out exposes the *structure* of how an answer is reached, while solving from scratch mostly rewards arriving at a surface-correct result. The sharpest evidence is that training a model to critique noisy, sometimes-wrong responses produces deeper understanding than training it to imitate correct answers — because critique forces engagement with the failure modes, not just the happy path Does critiquing errors teach deeper understanding than imitating correct answers?. Strikingly, this works even from a single problem: showing a model good-versus-bad solutions to one question and asking it to judge them is enough to unlock general reasoning, no trial-and-error reinforcement needed Can a single problem unlock reasoning through solution critique?.

Why would seeing the work beat doing the work? Because the value lives in the *process*, not the final answer. Models trained on full search traces — including the wrong turns, dead ends, and backtracking — end up 25% better than models trained only on clean, optimal solutions, because they learn an internal model of *how to search* rather than memorizing one fixed route Does training on messy search processes improve reasoning?. Solving from scratch, by contrast, tends to produce wandering, unsystematic exploration whose success rate collapses as problems get deeper Why do reasoning LLMs fail at deeper problem solving?. An explanation hands you the systematic path for free.

There's a deeper mechanism underneath this. Reasoning generalizes when it draws on broad, transferable *procedural* knowledge — the 'how to do this kind of thing' — rather than narrow factual recall tied to one document Does procedural knowledge drive reasoning more than factual retrieval?. Explanations are procedural knowledge made explicit; solving from scratch leaves the procedure implicit and often un-learned. The same theme shows up in work where simply extracting the rules latent in a worked example into reusable 'skills' lifts a frozen model's performance with no weight updates at all — the gain is purely from making the method visible Can frozen models learn better by extracting context into skills?.

But the corpus also draws the boundary lines, which is where it gets interesting. Explanations don't help unconditionally. Teacher-refined material actually *hurts* when it sits beyond the student's current frontier — a student has to filter for what it can actually absorb, meaning the best explanation is one calibrated to the learner, not the objectively best one Does teacher-refined data always improve student model performance?. This rhymes with the finding that medium-difficulty problems teach best: too easy carries no signal, too hard amplifies shortcuts, and the productive zone balances success against informative failure Why do medium-difficulty problems teach reasoning better than hard ones?. So 'learning from explanations' isn't a free lunch — it's a Goldilocks effect about matching the explanation to where the learner stands.

Two caveats worth carrying away. First, explanations can be unsettlingly hollow and still work: models trained on deliberately corrupted, semantically-irrelevant reasoning traces perform about as well as those trained on correct ones, hinting that traces sometimes act as computational *scaffolding* rather than meaningful instruction Do reasoning traces need to be semantically correct?. Second, for human learners the danger flips: explanations breed false confidence. Reasoning traces and post-hoc justifications make people accept AI answers whether or not they're right — and only *contrastive* explanations that argue both sides genuinely help a reader tell correct from incorrect Do explanations actually help users spot AI mistakes?. The lesson hiding here is the thing you didn't know you wanted to know: explanations teach best not when they show the right answer, but when they show the *contrast* between right and wrong.

Sources 10 notes

Does critiquing errors teach deeper understanding than imitating correct answers?

Training models to critique noisy responses outperforms training on correct answers because critique forces engagement with failure modes and structural reasoning. Even imperfect critique supervision beats correct-answer imitation, showing how weak surface-pattern learning is for building genuine understanding.

Can a single problem unlock reasoning through solution critique?

Critique Fine-Tuning achieves reasoning activation comparable to RLVR using only one problem and teacher-generated critiques of varied solutions, with no reinforcement learning. This demonstrates that exposure to correct versus incorrect reasoning on a specific problem is the sufficient activation signal.

Does training on messy search processes improve reasoning?

Stream of Search pretraining, which represents exploration and backtracking as serialized strings, achieves 25% higher accuracy than optimal-trajectory-only training. Models learn internal world models for search and adaptive strategies rather than fixed external methods.

Why do reasoning LLMs fail at deeper problem solving?

Current reasoning models lack the three properties of systematic exploration: validity, effectiveness, and necessity. This causes success probability to drop exponentially with problem depth, making medium problems solvable but deep problems catastrophically harder.

Does procedural knowledge drive reasoning more than factual retrieval?

Analysis of 5 million pretraining documents shows reasoning relies on broad, transferable procedural knowledge from diverse sources, unlike factual recall which depends on narrow, document-specific memorization of target facts.

Show all 10 sources

Can frozen models learn better by extracting context into skills?

Extracting natural-language rules from context into reusable skills improves frozen model reasoning without weight updates. On CL-bench, this lifts GPT-4.1 from 11.1% to 16.5%, with skills transferable across model backbones.

Does teacher-refined data always improve student model performance?

Teacher-refined data degrades performance when it exceeds the student's learning frontier, even if objectively higher quality. Students should filter refinements using their own statistical profile to retain only compatible improvements.

Why do medium-difficulty problems teach reasoning better than hard ones?

RLVR learning follows an inverted-U curve across difficulty: medium problems yield strongest gains because they balance success frequency with informative failures, while easy samples lack variance and hard samples amplify shortcuts.

Do reasoning traces need to be semantically correct?

Models trained on systematically irrelevant traces maintain solution accuracy and sometimes improve out-of-distribution generalization, suggesting traces function as computational scaffolding rather than meaningful reasoning steps.

Do explanations actually help users spot AI mistakes?

Reasoning traces and post-hoc explanations increase user acceptance of AI answers regardless of correctness, engendering false trust. Only dual explanations presenting arguments for and against the answer genuinely help users distinguish correct from incorrect outputs.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are an AI researcher auditing claims about why explanations outperform cold problem-solving for student learning (human and model). The question remains open: *what is the durable mechanism, and has recent work shifted the regime?*

What a curated library found — and when (dated claims, not current truth): Findings span 2022–2026; treat each as time-bound:
• Training on critique of noisy/wrong responses produces deeper understanding than imitation of correct answers; critique-fine-tuning on a single problem unlocks reasoning (~2025–2026).
• Models trained on full search traces (including mistakes and backtracking) perform ~25% better than those trained on clean solutions; reasoning LLMs are unsystematic explorers whose success collapses on deeper problems (~2024–2025).
• Procedural knowledge — reusable 'how-to' patterns — drives reasoning generalization more than factual recall; inference-time skill extraction boosts frozen models with zero weight updates (~2025).
• Medium-difficulty samples teach best; too easy/hard both fail; teacher-refined material *hurts* if beyond learner's frontier (~2026).
• Deliberately corrupted reasoning traces perform *as well* as correct ones (scaffolding ≠ semantic content); only contrastive (dual-sided) explanations help humans distinguish right from wrong (~2025–2026).

Anchor papers (verify; mind their dates):
• arXiv:2501.17703 (Critique Fine-Tuning, Jan 2025)
• arXiv:2505.20296 (Wandering Solution Explorers, May 2025)
• arXiv:2411.12580 (Procedural Knowledge, Nov 2024)
• arXiv:2605.28388 (Sample Difficulty Mechanism, May 2026)

Your task:
(1) RE-TEST EACH CONSTRAINT. For critique-based learning, search for models/methods that *bypass* the need for noisy data or contrastive pairs. For procedural knowledge, test whether end-to-end scaling and instruction tuning have since absorbed this insight such that plain imitation now rivals critique. For the Goldilocks difficulty effect, probe whether curriculum learning or adaptive masking has flattened the non-monotonic boundary. Separate the durable insight (contrast beats one-sided framing) from potentially relaxed constraints (need for curated difficulty, learner calibration).
(2) Surface the strongest DISAGREEMENT or SUPERSEDING work from the last ~6 months: look for papers claiming explanations *don't* help, or that end-to-end RL on correctness alone now outpaces all structured-reasoning approaches.
(3) Propose 2 research questions assuming the regime *has* moved: (a) Do multimodal or world-model pretraining change what procedural knowledge means? (b) In agents with long-horizon memory/planning, does cold problem-solving now match or beat explanation-based bootstrapping?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

For AI and humans alike, studying worked explanations may teach more than grinding out correct answers from scratch.

Related lines of inquiry

Sources 10 notes

Papers this line draws on 8