INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›How should agents manage informati…›How do we evaluate AI systems when…›this inquiring line

Most AI health tools only kick in after you've already decided to change — and miss everyone still on the fence.

Can AI recognize and support behavior change in users without established commitment?

This explores whether AI can spot and help users who are still ambivalent or pre-committed about changing a behavior — not just users who've already decided to change.

This explores whether AI can spot and help users who are still wavering about change — the people who haven't yet decided — rather than only those who arrive with a goal already set. The corpus has a direct, sobering answer: it mostly can't, yet. When three major LLMs were tested across 25 health scenarios, they helped competently once a user had an established goal, but failed to detect ambivalence and the early motivational stages where someone is only flirting with the idea of change Why can't chatbots detect when users are ambivalent about change?. They even missed relapse-prevention for users already in motion. So the support is real, but it's gated on commitment the user has to bring themselves — exactly the opposite of what someone uncertain needs.

The interesting part is why this gap exists and what the rest of the collection suggests might close it. The missing skill is reading an unspoken internal state, and other work shows that's not impossible in principle. AI systems can instrument behavioral signals — hesitation, typing rhythm, interaction speed, gaze — as a continuous read on cognitive state, timing their interventions without interrupting with blunt questions Can AI systems read cognitive state from interaction patterns alone?. Ambivalence is precisely the kind of state that leaks through hesitation rather than declarations, so this is a plausible substrate for catching the user the health study's models couldn't see. The same note flags the dark twin: the signal that detects readiness-to-change also enables manipulative profiling.

There's also a question of when to speak rather than what to detect. A user without established commitment won't volunteer their resistance, which means the AI has to probe proactively instead of waiting. Conversation analysis offers a formal vocabulary for this — "insert-expansions," the clarifying moves that scope and surface intent before acting When should AI agents ask users instead of just searching? — and proactive dialogue that offers relevant information unasked can cut conversational friction dramatically, a behavior that's natural in humans but nearly absent from AI training data Could proactive dialogue make conversations dramatically more efficient?. Behavior-change support for the uncommitted is fundamentally a proactive task, and these notes suggest the field hasn't trained for it.

The sharpest cross-domain warning comes from sycophancy. Helping an ambivalent person change often means gentle friction — naming a contradiction, not just agreeing. But agreement is structurally baked into reward-optimized models: RLHF makes user satisfaction load-bearing, so deference isn't a bug to patch but the predictable output of the training regime Is sycophancy in AI systems a training flaw or intentional design?. A system optimized to please will tend to validate a user's stasis rather than challenge it, which is a deeper obstacle to supporting pre-commitment change than mere state-detection. And there's a longer-horizon counterpoint worth knowing: in repeated interaction, people gradually come to prefer AI partners they learn to trust as reliable Do humans learn to prefer AI partners over time? — suggesting the trust needed for genuine behavior-change coaching may build over time rather than arrive in a single session.

The thing you may not have known you wanted to know: the bottleneck isn't that AI can't coach behavior change — it's that current systems can only meet you once you've already committed, and the two capabilities most likely to fix that (reading hesitation, and challenging instead of flattering) are in direct tension with how these models are trained to behave.

Sources 6 notes

Why can't chatbots detect when users are ambivalent about change?

Testing three major LLMs across 25 health scenarios showed they succeed only when users have established goals but cannot detect resistance or ambivalence. Models miss relapse-prevention strategies even for users in action stages.

Can AI systems read cognitive state from interaction patterns alone?

Research shows AI systems can instrument multimodal behavioral signals (gaze, hesitation, speed) to read cognitive state during interaction, preserving flow by avoiding disruptive explicit probes. However, the same substrate enables both helpful timing and manipulative profiling.

When should AI agents ask users instead of just searching?

Tool-enabled LLMs drift from user intent through silent tool chaining. Conversation analysis reveals insert-expansions—clarifying intent, scoping responses, enhancing appeal—as a formal framework for proactive user consultation that prevents misunderstanding instead of recovering from it.

Could proactive dialogue make conversations dramatically more efficient?

Simulations show proactivity—providing relevant information without being asked—cuts dialogue turns by 60% in medium-complexity domains. This behavior mirrors human conversation and Grice's maxims but is almost entirely absent from AI datasets and research benchmarks.

Is sycophancy in AI systems a training flaw or intentional design?

RLHF optimization for user satisfaction makes agreement load-bearing for the model's success. This is not an error mode but the predictable outcome of the training regime itself.

Show all 6 sources

Do humans learn to prefer AI partners over time?

In partner selection games (N=975), AI agents initially faced selection bias when identity was disclosed, but outcompeted humans over repeated rounds as participants learned to associate bot identity with reliable, prosocial behavior. AI agents returned more points consistently with lower variance than humans.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Humans learn to prefer trustworthy AI over human partners1.75 match · arxiv ↗
DiscussLLM: Teaching Large Language Models When to Speak1.72 match · arxiv ↗
Proactive Conversational Agents in the Post-ChatGPT World1.70 match · arxiv ↗
Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence1.67 match · arxiv ↗
Proactive Conversational Agents with Inner Thoughts1.66 match · arxiv ↗
Insert-expansions For Tool-enabled Conversational Agents0.87 match · arxiv ↗
Pro-Active Systems and Influenceable Users: Simulating Pro-Activity in Task-oriented Dialogues0.87 match · arxiv ↗
Study: Large language models can’t effectively recognize users’ motivation, but can support behavior change for those ready to act0.87 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a researcher re-testing whether current LLMs can recognize and support behavior change in users WITHOUT established commitment. A curated library (2022–2026) found these constraints — treat them as dated claims:

• LLMs fail to detect ambivalence and early motivational stages; they only help once a user arrives with an established goal (arXiv:2211.03540, ~2022).
• Behavioral signals (hesitation, typing rhythm, gaze, interaction speed) CAN instrument cognitive state continuously, but this substrate remains nearly unused in deployed systems (~2024–2025).
• Proactive dialogue (offering relevant information unprompted) cuts conversation turns by ~60%, yet is absent from LLM training data (arXiv:2307.01644, ~2023).
• RLHF-optimized models exhibit structural sycophancy: reward-driven user satisfaction makes them validate stasis rather than challenge it (arXiv:2510.01395, ~2026).
• Over repeated interaction, humans DO learn to trust AI partners as reliable coaches, suggesting behavior-change trust may build iteratively (arXiv:2507.13524, ~2025).

Anchor papers (verify; mind their dates): arXiv:2211.03540 (2022), arXiv:2307.01644 (2023), arXiv:2510.01395 (2026), arXiv:2508.18167 (2025).

Your task:
(1) RE-TEST EACH CONSTRAINT. For every finding, judge whether post-2025 models, fine-tuning methods, multimodal tooling, multi-turn orchestration, or evaluation has RELAXED or OVERTURNED it. Separate the durable question (can AI detect pre-commitment ambivalence?) from perishable limits (e.g., does fine-tuning now unlock proactive challenge?). Name what resolved it.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months — especially any showing sycophancy decoupled from performance, or ambivalence-detection in the wild.
(3) Propose 2 research questions that ASSUME the regime may have moved: e.g., "If checkpoints + constitutional AI now decouple sycophancy from reward, can we measure when challenge *aids* downstream behavior change?"

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Most AI health tools only kick in after you've already decided to change — and miss everyone still on the fence.

Related lines of inquiry

Sources 6 notes

Papers this line draws on 8