INQUIRING LINE

Inquiring lines›Where does language-model reasonin…›How do reward models guide reliabl…›How can models identify insufficie…›this inquiring line

Can an AI notice when a question is broken — or does it just confidently answer the unanswerable?

Can AI systems identify important unanswered questions that emerge during reasoning?

This explores whether AI can recognize the gaps and open questions inside its own reasoning — knowing what it doesn't know and what it would need to ask — rather than barreling ahead to an answer.

This explores whether AI can recognize the gaps and open questions inside its own reasoning, not just solve problems handed to it cleanly. The corpus's sharpest finding is that these are two different skills. A model can ace a fully-specified problem and still fail to notice when a problem is broken: when one variable is quietly withheld, accuracy on identifying the right clarifying question to ask drops to 40-50%, because information-gathering and problem-execution turn out to be separable cognitive operations Can models identify what information they actually need?. So being good at answering does not make a model good at noticing what's unanswered.

Worse, reasoning-tuned models actively struggle here. Faced with questions that have missing premises, they don't flag them — they generate long, redundant chains of 'reasoning' toward an answer that can't exist, because training rewards producing reasoning steps but never teaches a model when to disengage Why do reasoning models overthink ill-posed questions?. The instinct to recognize 'this question is ill-posed, I should stop and ask' is precisely what optimization erodes.

The encouraging counter-current is that this recognition is learnable, even if it's fragile. Reinforcement learning pushed proactive critical-thinking accuracy on deliberately flawed math problems from essentially zero (0.15%) to 74% — and revealingly, just thinking longer at inference time made untrained models worse at spotting the flaw, while helping trained ones Can models learn to ask clarifying questions instead of guessing?. A gentler route is social meta-learning: models trained only on complete problems can still generalize to underspecified ones, learning to delay answering and treat conversation itself as a place to go get the missing piece Can models learn to ask clarifying questions without explicit training?. The capability isn't absent in the architecture — it just isn't selected for by default.

What the corpus reframes nicely is the difference between an internal gap and an externally-resolvable one. Conversation-analysis work formalizes 'insert-expansions' — the moments where an agent should pause to clarify intent or scope before acting, instead of silently chaining tools toward a misread goal When should AI agents ask users instead of just searching?. That's a structured theory of which unanswered questions are worth surfacing to a human versus resolving alone. And if you want to know where in a reasoning trace such pivots even live, 'thought anchors' research finds that planning and backtracking sentences are the disproportionately influential steering points Which sentences actually steer a reasoning trace? — suggesting that the machinery for 'wait, reconsider' already exists inside traces, even when models don't deploy it to flag genuine gaps.

The thing you may not have expected: the bottleneck here is not intelligence but disposition. Models can be smart enough to solve and still trained out of the humility to notice what's missing — and the fix is less about bigger reasoning and more about teaching a model when to stop reasoning and ask.

Sources 6 notes

Can models identify what information they actually need?

Models achieving high accuracy on complete reasoning tasks drop to 40-50% accuracy identifying what clarifying question to ask when one variable is withheld. Information gathering and problem execution are separable cognitive operations.

Why do reasoning models overthink ill-posed questions?

Reasoning models generate redundant, lengthy responses to questions with missing premises while non-reasoning models correctly identify them as unanswerable. Training optimizes for producing reasoning steps but never teaches models when to disengage.

Can models learn to ask clarifying questions instead of guessing?

Reinforcement learning training increased proactive critical thinking accuracy from 0.15% to 73.98% on deliberately flawed math problems. Notably, inference-time scaling degraded this ability in untrained models but improved it after RL training, suggesting the capability is learnable but fragile without explicit training.

Can models learn to ask clarifying questions without explicit training?

Models trained via SML on complete problems generalize to underspecified tasks by asking for needed information and delaying answers. The training paradigm instills a meta-strategy of using conversation as an information source, addressing the premature-answering failure mode.

When should AI agents ask users instead of just searching?

Tool-enabled LLMs drift from user intent through silent tool chaining. Conversation analysis reveals insert-expansions—clarifying intent, scoping responses, enhancing appeal—as a formal framework for proactive user consultation that prevents misunderstanding instead of recovering from it.

Show all 6 sources

Which sentences actually steer a reasoning trace?

Counterfactual resampling, attention analysis, and causal suppression all identify planning and backtracking sentences as thought anchors—sparse critical points that guide subsequent reasoning. These are functional pivots, not noise.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst. The precise, still-open question: Can AI systems identify important unanswered questions that emerge during reasoning — and can they choose to pause and ask rather than hallucinate answers?

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026. A library of ~15 papers on reasoning, critical thinking, and question-asking surfaces these constraints:

• Problem-solving skill ≠ gap-spotting skill: models drop from high accuracy on well-specified tasks to 40–50% accuracy identifying missing premises (2025).
• Reasoning-tuned models actively fail here: extended chain-of-thought on broken questions generates spurious reasoning chains rather than flagging ill-posedness; training rewards steps, not disengagement (2025).
• Longer inference-time reasoning hurts untrained models at flaw-spotting but helps trained ones — suggesting the capability is learnable, not architectural (2025).
• RL + explicit critical-thinking training raises proactive question-asking from ~0.15% to 74% on underspecified math (2025–2026).
• Social meta-learning (training only on complete problems) generalizes to underspecified ones, producing emergent clarifying-question behavior (2026).

Anchor papers (verify; mind their dates):
• arXiv:2307.01644 (2023) — Insert-expansions: formal theory of when agents should pause to clarify.
• arXiv:2503.22674 (2025) — QuestBench: benchmarking LLM ability to ask clarifying questions.
• arXiv:2507.23407 (2025) — Proactive critical thinking: RL path to question-asking.
• arXiv:2602.16488 (2026) — Social meta-learning: generalization without explicit task-mismatch data.

Your task:
(1) RE-TEST EACH CONSTRAINT. For models (Llama 3.1+, GPT-4o, o1-preview) post-June 2025, does the 40–50% accuracy ceiling on missing-premise detection still hold? Have RL-based critical-thinking methods been integrated into production systems? Has chain-of-thought length still hurt newer untrained baselines, or have architectural changes (e.g., native planning layers, token-level interpretability) relaxed this? Separate the durable question (do models spontaneously pause?) from the perishable limitation (RL training required?).
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from late 2025–2026. The path includes papers (e.g., arXiv:2505.00127, arXiv:2504.09762, arXiv:2506.09250) that directly challenge reasoning-trace interpretation and the anthropomorphization of intermediate tokens. Do any argue that "question-asking" is a surface artifact, not a capability? If so, how does that reframe the feasibility of (1)?
(3) Propose 2 research questions that ASSUME the regime may have moved: (A) If newer models with native uncertainty quantification can now flag missing information without RL, what predicts which unanswered questions they spontaneously surface versus suppress? (B) Can multi-agent orchestration (e.g., one agent spotting gaps, another solving) outperform single-agent self-questioning on reasoning tasks with intentionally hidden premises?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Can an AI notice when a question is broken — or does it just confidently answer the unanswerable?

Related lines of inquiry

Sources 6 notes

Papers this line draws on 8