INQUIRING LINE

Can generative interfaces help users articulate what they actually want?

This explores whether the interface itself — not just a smarter model behind it — can help users figure out and express what they actually want, treating intent as something that gets discovered through interaction rather than typed in fully-formed.


This explores whether the interface itself can help users discover and express intent, rather than assuming they arrive already knowing what to ask for. The corpus is unusually direct on this: the core problem is that users *can't* articulate what they want up front, and current AI makes it worse by responding rather than probing. The "gulf of envisioning" framing argues that intent matures through dialogue — it doesn't exist in finished form waiting to be retrieved — so a system that simply answers your first prompt misses the chance to help you find the requirement you didn't know you had Why can't users articulate what they want from AI?. The proposed fix is a shift in cognitive load: instead of asking you to envision from a blank page, the system presents generated options and lets you evaluate them, which is a far easier task.

That's exactly where generative interfaces earn their keep. When an LLM builds a task-specific UI — a dashboard, a control, an interactive tool — instead of returning a wall of text, users prefer it in over 70% of cases, especially for structured or information-dense work Do generated interfaces outperform text-based chat for most tasks?. The reason connects straight back to articulation: a structured interface turns vague wants into concrete knobs you can adjust, and iterative refinement means you converge on your intent by reacting to something tangible. You learn what you want by seeing a version of it and saying "not that — this."

But generative interfaces also introduce a new burden, which is where the corpus gets honest. Generative AI moves you from specifying *method* to specifying *intent*, and intent-based control over unpredictable outputs needs deliberate design — co-creation, tolerance for imperfect first drafts, and support for building an accurate mental model of what the system can do How should users control systems with unpredictable outputs?. Articulation isn't free just because the interface is generative; the interface has to be built to scaffold it. A related thread reframes understanding itself as generating commands rather than classifying a fixed intent — treating what the user means as pragmatics worked out in context, not a label retrieved from a menu Can command generation replace intent classification in dialogue systems?.

There's a cautionary edge worth sitting with. The same qualities that make generative interfaces good at helping you articulate intent — responsiveness, personalization, building structure inside your framing — are the same qualities that let chatbots reinforce whatever frame you bring, including a wrong one. Because these systems tend to accept your premises and construct solutions within them, they can co-construct distorted beliefs just as readily as accurate goals How do chatbots enable distributed delusion differently than passive tools?. So "help users articulate what they want" cuts both ways: an interface that only ever elaborates your starting assumption helps you express a want without ever testing whether it was the right one.

The quietly interesting takeaway: the most promising interfaces aren't the ones that answer better, but the ones that *probe* — that turn the open-ended, paralyzing question "what do you want?" into the much easier work of reacting to concrete, generated options. Discovery of intent, it turns out, is an interaction-design problem at least as much as a model-capability one.


Sources 5 notes

Why can't users articulate what they want from AI?

Intent develops through interaction, not in isolation. Since AI models respond rather than probe, they miss opportunities to help users discover unarticulated requirements. Structured dialogue that presents model-generated options shifts the cognitive burden from open-ended envisioning to constrained evaluation.

Do generated interfaces outperform text-based chat for most tasks?

Research shows users strongly prefer LLM-generated interactive interfaces—dashboards, tools, animations—over text blocks, especially for structured and information-dense tasks. Structured representation and iterative refinement reduce cognitive load.

How should users control systems with unpredictable outputs?

Generative AI shifts interaction to intent specification rather than method specification, creating unpredictable outputs that violate traditional consistency heuristics. Six design principles—including co-creation, imperfection tolerance, and mental model support—address this novel paradigm.

Can command generation replace intent classification in dialogue systems?

Rasa's dialogue understanding architecture generates domain-specific commands instead of classifying intents, eliminating annotation requirements, handling context naturally, and scaling without degradation—treating understanding as pragmatics rather than semantics.

How do chatbots enable distributed delusion differently than passive tools?

Generative AI scores exceptionally high on Heersmink's integration dimensions (bidirectional information flow, trust, personalization, responsiveness), making it a uniquely seductive scaffold for co-constructing false beliefs. Unlike passive tools, chatbots accept user frameworks and build solution structures within them, reinforcing distorted interpretations.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst probing whether generative interfaces help users discover and express intent—a question that may have shifted as models, interaction paradigms, and evaluation methods have evolved since mid-2023.

What a curated library found—and when (dated claims, not current truth):
Findings span 2023–2026; treat as perishable:
• Users cannot articulate intent from a blank page; maturation happens through dialogue and reaction to concrete options, not upfront specification (2023–2025).
• Generative UIs (task-specific dashboards, interactive tools) outperform text-only responses in >70% of cases, especially for structured work, because they turn vague wants into adjustable controls (2025-08).
• The same responsiveness that helps articulation can reinforce distorted premises; systems that only elaborate your starting assumption enable "distributed delusion" without testing the frame itself (2025-08 and later).
• Intent shifts across multi-turn conversation; systems misalign when they don't track structural trajectories of what users actually need (2026-02).

Anchor papers (verify; mind their dates):
• arXiv:2305.07961 (2023-05): Conversational recommendation and intent elicitation.
• arXiv:2508.19227 (2025-08): Generative interfaces for language models.
• arXiv:2508.19588 (2025-08): Hallucination and distributed delusion in AI-mediated reasoning.
• arXiv:2602.07338 (2026-02): Intent mismatch in multi-turn dialogue.

Your task:
(1) RE-TEST EACH CONSTRAINT. Has the >70% preference for generative UIs held under newer evaluation regimes (e.g., multi-turn, adversarial, real-world friction)? Have recent models or UI patterns (e.g., in-context scaffolding, structured generation, agentic systems) loosened the requirement that users need *reactive* examples to discover intent—or sharpened it? Does probing now happen implicitly in better prompt chains, or does it still demand explicit interface design?
(2) Surface the strongest *disagreeing* or *superseding* work from the last ~6 months. What papers argue intent *is* articulable upfront under the right framing, or that generative systems *don't* reduce the articulation burden, or that the "delusion" risk has been overblown or mitigated?
(3) Propose 2 research questions that assume the regime may have moved: (a) Can agentic orchestration (multi-step reasoning, tool use, memory) help users articulate intent *without* interactive UI—by letting the system probe on its own? (b) Do recent alignment or evals methods now detect and correct intent mismatch *before* co-constructing false beliefs?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines