SYNTHESIS NOTE

Can models learn to ask clarifying questions without explicit training?

Do language models trained only on fully-specified problems spontaneously develop the ability to ask for missing information when facing underspecified tasks? This tests whether conversational problem-solving strategies emerge from meta-learning rather than direct instruction.

Synthesis note · 2026-05-18 · sourced from Training Fine Tuning

A surprising generalization result from the social meta-learning training paradigm. The training procedure uses only fully-specified problems — the student receives the complete problem statement from the first turn, and the teacher provides feedback during attempts to solve it. None of the training problems require the student to handle missing information. Yet the trained model performs significantly better on underspecified tasks at test time, where critical information is revealed only across multiple conversational turns.

The behavioral signature is specific: SML-trained models make fewer premature answer attempts and are more likely to ask for the information they need. They learn to recognize when they lack enough information to answer well and to extract that information from the conversation partner. This is the human pattern of "ask before answering when you're not sure" — emerging in an LLM that was never explicitly trained on the pattern.

The mechanism appears to be that SML training teaches the model a meta-strategy: use the conversation as a resource. This strategy generalizes from "use the conversation to refine an answer to a fully-specified problem" (training distribution) to "use the conversation to get missing information first, then answer" (test distribution). The student has learned not just to solicit corrective feedback but to model the conversation as a place where information flows.

The result can be sharpened with a two-stage training procedure called Q-priming. A preliminary SFT stage trains the model on dialogues where it has been explicitly prompted to ask questions, leveraging the teacher's private knowledge to generate good question examples. After Q-priming, online RL via SML refines the behavior further. The combined pipeline produces stronger clarifying-question behavior than either alone.

For conversational AI design, this is an existence proof: the structural skill of "ask before answering" can be installed via training rather than via runtime prompting. Systems that have struggled with the "LLM answers prematurely" failure mode can address it at the training level rather than relying on prompt engineering.

Inquiring lines that read this note 36

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How can models identify insufficient information and respond appropriately without guessing?

Why do LLM chatbots fail as independent therapeutic agents?

Why can't language models conduct genuine Socratic questioning in therapy sessions?

Why do language models reinforce false assumptions instead of correcting them?

What makes dialogue-based explanation more successful than monologue?

What makes specific clarifying questions more effective than generic ones?

How can language models sustain linguistic synchrony and intersubjectivity during dialogue?

What capability tradeoffs emerge when scaling model reasoning abilities?

Can models learn when to think versus answer directly?

Why do reasoning models fail at systematic problem-solving and search?

Why do models overthink underspecified problems instead of rejecting them?

Does reinforcement learning teach reasoning or just when to reason?

Can reinforcement learning teach AI when to ask clarifying questions?

How do training data properties shape reasoning capability development?

Can models be trained to explain instead of imitate answers?

Does RLHF training sacrifice accuracy and grounding for user agreement?

How does RLHF training reward models for guessing over asking clarifying questions?

How do training priors constrain what context information can override?

Can Q-priming further strengthen clarifying question behavior beyond social meta-learning alone?

How should dialogue recommender systems manage conversation history and state?

How does treating conversation as a resource change what models learn to do?

How do self-generated feedback mechanisms enable effective model learning?

What emergent behaviors do models develop when trained on underspecified pedagogical tasks?

Related concepts in this collection 4

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 114 in 2-hop network ·medium cluster Open in graph ↗

Can models learn to ask clarifying questions wit… Can LLMs learn to ask for feedback during problem … Why does teacher-student information asymmetry ena… Why can't conversational AI agents take the initia… Why do models fail at asking good questions during…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can models learn to ask clarifying questions without explicit training?

Inquiring lines that read this note 36

Related concepts in this collection 4

Related papers in this collection 8

Search by related questions 4