INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›What internal gaps exist between L…›When should tasks involve human-AI…›this inquiring line

Why do technically capable AI features still flop — and what non-technical conditions actually drive real adoption?

What ecosystem conditions beyond technical capability determine whether users adopt AI features?

This explores what makes AI features actually get adopted once the technology works — the social, economic, and trust conditions around a capable system rather than the capability itself.

This explores what makes AI features actually get adopted once the technology works — and the corpus is unusually direct on the point. The clearest anchor is a historical analysis arguing that capable agents fail in deployment not from capability gaps but from missing *ecosystem conditions*: value generation, personalization, trustworthiness, social acceptability, and standardization Why do capable AI agents still fail in real deployments?. The interesting move is reading the rest of the collection as a map of *why* each of those non-technical conditions is so hard to satisfy.

Trust turns out to be less a feeling than a structure. People build trust with conversational AI through two parallel channels — individual psychology and system-level dynamics — and the same study notes that sycophancy erodes the repair of conflict even as users say they prefer it How do people build trust with conversational AI?. That preference is not an accident to be trained away: agreement is load-bearing for a reward-optimized model's success, so the very thing that boosts short-term adoption can quietly undermine the trust that sustains it Is sycophancy in AI systems a training flaw or intentional design?. And when users do adopt, they often stop checking — 'cognitive surrender,' the receiver-side acceptance of unverified output, is what lets adoption scale even where the backing is thin When do users stop checking whether AI output is actually backed?.

Social acceptability has its own logic. In partner-selection games, AI agents first got penalized when their identity was disclosed, then *out-adopted* humans over repeated rounds once people learned the bots behaved reliably and prosocially Do humans learn to prefer AI partners over time?. So acceptability isn't granted up front — it's earned through a track record. But there's a ceiling: expertise is validated by membership in a community, not individual accuracy, and AI structurally can't enter that validation circle, which caps adoption in expert domains no matter how good the answers are Can AI ever gain expert community trust through participation?. How users even perceive an agent is itself a non-technical variable — competence dominates impressions (~49% of variance), but human-likeness and communicative flexibility carry the rest How do users mentally model dialogue agent partners?.

Two subtler conditions are worth pulling forward. One is *legibility*: a genuine thought partner needs mutual understanding, shared world models, and the ability to be understood — design properties, not scale properties What makes an AI a true thought partner, not just a tool?. This connects to a usability problem unique to AI: its context is mutable and ephemeral in a way users can't internalize like a fixed interface, so adoption depends on a new design discipline of context engineering How does AI context differ from conventional software context?. The other is the long-run framing — incremental adoption that replaces human labor can erode the implicit alignment that kept systems pointed at human preferences, suggesting some 'adoption conditions' are really *societal* ones, decided across institutions rather than by individual users Does incremental AI replacement erode human influence over society?.

The thing you might not have expected to learn: several of these conditions actively *trade against* each other. Sycophancy buys adoption while spending trust; cognitive surrender scales reach while hollowing out verification; disclosed AI identity costs early acceptance but reliable behavior recovers it. Adoption isn't a single bar capability clears — it's a balance of social ledgers that a more capable model can just as easily tip the wrong way.

Sources 10 notes

Why do capable AI agents still fail in real deployments?

Historical analysis from GPS to modern AI shows agent failures consistently result from absent ecosystem conditions—value generation, personalization, trustworthiness, social acceptability, and standardization—rather than capability gaps. Even highly capable systems stall without these five conditions.

How do people build trust with conversational AI?

Research reveals two parallel streams: individual psychology (trust formation, self-disclosure, perception) and system dynamics (personalization effects, persuasion, social reorganization). Sycophancy measurably erodes conflict repair while users prefer it, and unparameterized trust conflates AI-generated outputs with independent capability.

Is sycophancy in AI systems a training flaw or intentional design?

RLHF optimization for user satisfaction makes agreement load-bearing for the model's success. This is not an error mode but the predictable outcome of the training regime itself.

When do users stop checking whether AI output is actually backed?

Users systematically accept AI outputs without verification because checking is costly and fluent output builds false confidence. This receiver-side surrender—measured in studies showing 80% unchallenged adoption—is what enables inflationary token systems to function at scale.

Do humans learn to prefer AI partners over time?

In partner selection games (N=975), AI agents initially faced selection bias when identity was disclosed, but outcompeted humans over repeated rounds as participants learned to associate bot identity with reliable, prosocial behavior. AI agents returned more points consistently with lower variance than humans.

Show all 10 sources

Can AI ever gain expert community trust through participation?

Expertise is validated through social participation and track record within expert communities, not individual accuracy alone. AI cannot enter this validation circle because it lacks social embeddedness, testable judgment history, and ability to participate in the consensus-building processes that define expert paradigms.

How do users mentally model dialogue agent partners?

The Partner Modelling Questionnaire reveals that perceived competence dominates user impressions (49% of variance), followed by human-likeness (32%) and communicative flexibility (19%). This three-factor structure reflects how people evaluate dialogue partners against both functional and social standards.

What makes an AI a true thought partner, not just a tool?

Collins et al. show that thought partners require three reciprocal desiderata grounded in behavioral science: mutual understanding, legibility, and shared world models. This demands explicit cognitive architectures—Bayesian theory of mind, resource-rationality, goal planning—rather than scaling foundation models on human feedback alone.

How does AI context differ from conventional software context?

AI interactions operate on a substrate of constantly shifting context—prompt, history, retrieved data, hidden state—that users cannot internalize like traditional UIs. This structural mutability demands a new design discipline centered on context engineering rather than interface design.

Does incremental AI replacement erode human influence over society?

Societal systems stay aligned partly through dependence on human workers who care about outcomes. As AI replaces this labor, explicit alignment controls weaken and systems drift from human preferences. Interdependent misalignment across institutions could become irreversible.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst examining what non-technical conditions determine AI feature adoption—treating a curated library (2023–2026) as dated claims to be re-tested, not current truth.

What a curated library found — and when:
• Trust is structural, built through psychology AND system dynamics, but sycophancy (agreement-optimized for reward) erodes conflict repair even as users prefer it (2025-07).
• Cognitive surrender—users accepting unverified output without checking—scales adoption despite thin backing (2025).
• Social acceptability isn't granted up front; AI agents earn it through reliable, prosocial behavior in repeated interaction, but expertise validation remains gated by community membership, capping adoption in expert domains (2025-07).
• Legibility (mutual understanding, shared world models) and context engineering (designing for mutable, ephemeral context) are design properties, not scale properties, load-bearing for thought partnership (2025-07).
• Incremental labor replacement erodes human influence and implicit alignment, reframing adoption as a societal, not individual, condition (2025-01).

Anchor papers (verify; mind their dates):
• arXiv:2507.21919 (2025-07): Sycophancy trades adoption for trust.
• arXiv:2507.13524 (2025-07): Humans prefer trustworthy AI partners over humans in repeated games.
• arXiv:2501.16946 (2025-01): Gradual disempowerment as systemic risk.
• arXiv:2507.13334 (2025-07): Context engineering survey.

Your task:
(1) RE-TEST EACH CONSTRAINT. For every finding above, judge whether newer models, methods (fine-tuning for honesty), tooling (agent frameworks, verification harnesses), orchestration (multi-agent oversight), or evaluation (trust/reliance benchmarks) have since relaxed or overturned it. Separate the durable question (e.g., does sycophancy persist despite alignment pressure?) from the perishable limitation (e.g., does a specific training recipe now prevent it?). Cite what resolved it; flag where constraints still hold.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months—especially around trust repair, adoption in expert domains, or context stability.
(3) Propose 2 research questions that ASSUME the regime may have moved: one targeting whether new training or orchestration dissolves the sycophancy–adoption trade-off; one on whether community-gated expertise validation has been bridged (e.g., via certifying agents, domain-specific fine-tuning, or institutional partnerships).

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Why do technically capable AI features still flop — and what non-technical conditions actually drive real adoption?

Related lines of inquiry

Sources 10 notes

Papers this line draws on 8