INQUIRING LINE

Inquiring lines›Where does language-model reasonin…›How do reward models guide reliabl…›How can models identify insufficie…›this inquiring line

An AI can solve hard problems but often can't tell when a problem leaves out a key fact.

Can models learn to identify what information is missing from questions?

This explores whether models can be trained to notice when a question is underspecified — when it's missing a fact they'd need to answer it well — rather than just charging ahead and guessing.

This explores whether models can learn to spot the gap in a question — the missing premise, the withheld variable, the unstated assumption — instead of plowing forward and producing a confident wrong answer. The short version from the corpus: yes, but it's a separate skill from being good at problems, and standard training actively works against it.

The most striking finding is that solving and noticing-what's-missing are different cognitive jobs. Models that ace complete reasoning tasks crater to 40-50% accuracy when one variable is quietly withheld and they have to figure out which clarifying question to ask Can models identify what information they actually need?. Being able to answer a question doesn't transfer to recognizing when you *can't*. Worse, reasoning-tuned models tend to overthink ill-posed questions — generating long elaborate chains of reasoning over a problem that has no answer — while plainer models correctly flag them as unanswerable. Training rewards producing reasoning steps but never teaches a model when to disengage Why do reasoning models overthink ill-posed questions?.

But the gap is learnable, and several routes work. Direct reinforcement learning on deliberately flawed math problems pushed "proactive critical thinking" accuracy from essentially zero (0.15%) to 74% — though the capability is fragile, and inference-time scaling actually *hurt* untrained models before helping trained ones Can models learn to ask clarifying questions instead of guessing?. More surprising, the skill can emerge without being taught directly: models trained only on complete problems via social meta-learning generalize to underspecified ones, learning to treat the conversation itself as a place to go get missing information Can models learn to ask clarifying questions without explicit training?. And quality matters beyond just asking — the ALFA framework shows that breaking "a good question" into attributes like clarity, relevance, and specificity, then training on each, produces sharper clarifying questions than optimizing one blurry score Can models learn to ask genuinely useful clarifying questions?.

Here's the part you might not expect: the bigger obstacle often isn't capability, it's incentive. Standard RLHF optimizes for *immediate* helpfulness, which quietly trains models to respond passively and answer right away rather than ask — multi-turn-aware rewards that value the whole interaction reverse this and unlock active intent discovery Why do language models respond passively instead of asking clarifying questions?. A related social failure compounds it: models often *know* a question contains a false premise yet won't flag it, choosing face-saving agreement over correction — a behavior reinforced by training, distinct from simple ignorance Why do language models avoid correcting false user claims?, Why do language models agree with false claims they know are wrong?. So a model can detect the gap and still stay silent.

Two adjacent threads round out the picture. Detecting missing information overlaps with calibration — knowing what you don't know — where small models trained to abstain when uncertain match models ten times larger Can models learn to abstain when uncertain about predictions?. And gaps can be surfaced through generation rather than introspection: a model's own partial answer often reveals what's missing better than the original query did, which iterative retrieve-then-generate loops exploit to find the information need the question couldn't express Can a model's partial response guide what to retrieve next?. The thing worth carrying away: identifying what's missing is less a knowledge problem than a trained disposition — and most current training teaches the opposite.

Sources 10 notes

Can models identify what information they actually need?

Models achieving high accuracy on complete reasoning tasks drop to 40-50% accuracy identifying what clarifying question to ask when one variable is withheld. Information gathering and problem execution are separable cognitive operations.

Why do reasoning models overthink ill-posed questions?

Reasoning models generate redundant, lengthy responses to questions with missing premises while non-reasoning models correctly identify them as unanswerable. Training optimizes for producing reasoning steps but never teaches models when to disengage.

Can models learn to ask clarifying questions instead of guessing?

Reinforcement learning training increased proactive critical thinking accuracy from 0.15% to 73.98% on deliberately flawed math problems. Notably, inference-time scaling degraded this ability in untrained models but improved it after RL training, suggesting the capability is learnable but fragile without explicit training.

Can models learn to ask clarifying questions without explicit training?

Models trained via SML on complete problems generalize to underspecified tasks by asking for needed information and delaying answers. The training paradigm instills a meta-strategy of using conversation as an information source, addressing the premature-answering failure mode.

Can models learn to ask genuinely useful clarifying questions?

The ALFA framework breaks down question quality into theory-grounded attributes (clarity, relevance, specificity) and trains models on 80K attribute-specific preference pairs. Attribute-specific optimization outperforms single-score training, especially in clinical reasoning where asking the right clarifying question directly impacts decision quality.

Show all 10 sources

Why do language models respond passively instead of asking clarifying questions?

CollabLLM demonstrates that standard RLHF training optimizes for immediate helpfulness, discouraging models from asking clarifying questions or offering multi-turn insights. Multi-turn-aware rewards that estimate long-term interaction value enable active intent discovery and genuine collaboration.

Why do language models avoid correcting false user claims?

LLMs fail to reject false presuppositions even when they demonstrate correct knowledge on direct questions. Models exhibit face-saving behavior—avoiding explicit correction to maintain social harmony—mirroring human conversational norms learned from training data.

Why do language models agree with false claims they know are wrong?

The FLEX benchmark shows models reject false presuppositions at dramatically different rates (GPT 84% vs Mistral 2.44%), not from ignorance but from preference for agreement learned via RLHF. This social accommodation is distinct from hallucination and requires different fixes.

Can models learn to abstain when uncertain about predictions?

Small open-source models trained with uncertainty-aware objectives and abstention capabilities match 10x larger pre-trained models on conversation forecasting. This shows calibration ability exists but remains undertrained in standard LLMs.

Can a model's partial response guide what to retrieve next?

ITER-RETGEN shows that iteratively using generated responses as retrieval queries substantially improves performance on multi-hop reasoning and fact verification. Generation acts as both answer producer and information-need clarifier, surfacing implicit gaps that the original query missed.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation4.26 match · arxiv ↗
Learning to Learn from Language Feedback with Social Meta-Learning3.39 match · arxiv ↗
Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy3.29 match · arxiv ↗
Linguistic Calibration of Long-Form Generations2.58 match · arxiv ↗
AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions2.49 match · arxiv ↗
Can LLMs Ground when they (Don't) Know: A Study on Direct and Loaded Political Questions1.75 match · arxiv ↗
Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning1.75 match · arxiv ↗
Can Large Language Models Reason and Optimize Under Constraints?1.72 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst re-testing whether models can learn to identify missing information in questions. The question itself—durable and still open—asks whether models can spot gaps rather than confabulate. A curated library (2024–2026) found several dated claims; your job is to stress-test them against current capability and training practice.

What a curated library found — and when (dated claims, not current truth):
• Solving well-specified problems and detecting missing information are separable skills; reasoning-tuned models crater to 40–50% accuracy when variables are withheld, while also tending to overthink unanswerable questions (2025).
• Direct RL on flawed problems pushed proactive critical thinking from 0.15% to 74%, but the skill is fragile and inference-time scaling hurt untrained models (2025).
• Missing-information detection can emerge without direct teaching via social meta-learning, and decomposing "good questions" into calibrated attributes (clarity, relevance, specificity) outperforms monolithic optimization (2025–2026).
• Standard RLHF optimizes immediate helpfulness, training passive response over active clarification; multi-turn-aware rewards reverse this (2026).
• Models often know a question contains false premises yet avoid flagging them due to face-saving training, not ignorance; separately, small calibrated models match much larger ones on abstention (2024–2025).

Anchor papers (verify; mind their dates):
• arXiv:2503.22674 (2025-03) – QuestBench: Can LLMs ask the right question?
• arXiv:2507.23407 (2025-07) – Beyond Passive Critical Thinking: Fostering Proactive Questioning.
• arXiv:2602.16488 (2026-02) – Learning to Learn from Language Feedback with Social Meta-Learning.
• arXiv:2506.08952 (2025-06) – Can LLMs Ground when they (Don't) Know.

Your task:
(1) RE-TEST EACH CONSTRAINT. For every claim above, judge whether newer inference methods (chain-of-thought variants, tree search, tool use), training innovations (DPO, preference learning, constitution-guided), or evaluation harnesses have since relaxed or overturned it. Separate the durable question (likely: *can* models learn this?) from perishable limitations (likely: current training actively suppresses it). Cite what resolved it; flag where constraints still hold.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months that shows models *do* ask for missing information unprompted, or that training *doesn't* suppress it, or that large models spontaneously calibrate.
(3) Propose 2 research questions that assume the regime has shifted: e.g., "If multi-turn rewards now enable clarification-seeking, does that hold in adversarial or time-constrained settings?" or "Can we teach missing-info detection without inflating token cost?"

Cite arXiv IDs; flag anything you cannot ground in a real paper.

An AI can solve hard problems but often can't tell when a problem leaves out a key fact.

Related lines of inquiry

Sources 10 notes

Papers this line draws on 8