INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›What internal gaps exist between L…›How does AI adoption affect human…›this inquiring line

AI capability and worker preference are pulling in opposite directions — and most investment is backing the wrong side.

How does capability differ from what workers actually want from AI?

This explores the gap between what AI can technically do (capability) and what workers say they actually want from it — and the corpus suggests the two are pulling in different directions.

This explores the gap between raw AI capability and worker preference, and the corpus is surprisingly blunt about it: the question of "can the model do this?" and "do workers want it done this way?" are answered by completely different research, and the answers don't line up. The clearest single signal comes from a survey of 1,500 workers across 844 tasks, where equal human-AI partnership was the *preferred* arrangement for 45% of occupations — yet 41% of startup investment targeted zones that ignore those preferences entirely What collaboration level do workers actually want with AI?. So the money is chasing autonomy and replacement while the people doing the work are asking for a partner.

What's striking is that capability turns out not to be the bottleneck workers care about. Even highly capable agents stall in deployment for reasons that have nothing to do with intelligence — they fail on the five ecosystem conditions of trust, social acceptability, personalization, value, and standardization Why do capable AI agents still fail in real deployments?. And once agents start acting economically, the limiting factor shifts from model capability to whether they can coordinate, settle accounts, and leave an auditable trail When do agents need coordination more than raw capability?. Workers, in other words, want reliability and legibility, not more raw horsepower — a benchmark of simulated work found leading agents complete only 30% of tasks, failing most on social interaction and domain knowledge rather than reasoning Why do AI agents fail at workplace social interaction?.

There's a deeper wrinkle hiding here: workers may want something AI is structurally bad at giving, and may not even be able to name it. Intent develops *through* interaction, but AI responds rather than probes — so it misses the chance to help people discover what they actually want, leaving them stuck in a "gulf of envisioning" Why can't users articulate what they want from AI?. Capability assumes a clear target; preference is often unformed until the work is underway.

The most counterintuitive piece is what AI does to the worker's own sense of competence. Productivity gains show up only when people apply skills they already have — try to learn something new with AI and the gains vanish When does AI actually boost worker productivity?. The capability acts like an exoskeleton: skilled-looking output while the AI is present, baseline performance the moment it's gone Does AI assistance build lasting skills or temporary abilities?. And workers can misread that borrowed capability as their own — the "LLM Fallacy" of attributing the machine's output to personal skill How does AI-assisted work reshape how people see their own abilities?. So the thing capability delivers (impressive output now) and the thing many workers actually want (durable skill, agency, a partner that grows them) are quietly at odds. The unsettling read across these notes: the more capable the tool, the easier it is to mistake having the tool for being good — which is precisely the partnership workers said they wanted, inverted into dependence.

Sources 8 notes

What collaboration level do workers actually want with AI?

The HumanAgency Scale survey of 1,500 workers across 844 tasks found that equal partnership (H3) is the dominant desired level in 45% of occupations. Yet 41% of startup investments target zones misaligned with these worker preferences.

Why do capable AI agents still fail in real deployments?

Historical analysis from GPS to modern AI shows agent failures consistently result from absent ecosystem conditions—value generation, personalization, trustworthiness, social acceptability, and standardization—rather than capability gaps. Even highly capable systems stall without these five conditions.

When do agents need coordination more than raw capability?

Once agents hold credentials, transact value, and interact with other agents, raw model capability stops being the limiting factor. The real bottleneck becomes whether agents can coordinate reliably, settle accounts, and leave auditable evidence of their actions.

Why do AI agents fail at workplace social interaction?

TheAgentCompany benchmark shows leading agents achieve 30% task completion in a simulated workplace. Social interaction, professional UI navigation, and domain-specific knowledge are the three primary failure modes, with multi-turn task performance consistently dropping to 35% across enterprise settings.

Why can't users articulate what they want from AI?

Intent develops through interaction, not in isolation. Since AI models respond rather than probe, they miss opportunities to help users discover unarticulated requirements. Structured dialogue that presents model-generated options shifts the cognitive burden from open-ended envisioning to constrained evaluation.

Show all 8 sources

When does AI actually boost worker productivity?

Studies showing AI productivity gains measured tasks within workers' existing domains. When workers used AI to learn new skills, productivity gains disappeared and learning suffered, suggesting prior findings do not generalize to skill acquisition.

Does AI assistance build lasting skills or temporary abilities?

Research shows AI assistance creates temporary capability extensions—workers produce skilled-looking output while AI is present but revert to baseline performance when access is removed. This differs fundamentally from true skill, which persists independently.

How does AI-assisted work reshape how people see their own abilities?

Research shows the LLM Fallacy operates through misattribution of AI outputs to personal capability, independent of output accuracy or reliance behavior. It requires interventions that clarify human-machine contribution boundaries, not just better system accuracy or forced verification.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce3.17 match · arxiv ↗
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries2.46 match · arxiv ↗
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks2.38 match · arxiv ↗
Working with AI: Measuring the Occupational Implications of Generative AI2.27 match · arxiv ↗
Intelligent AI Delegation1.66 match · arxiv ↗
Artifacts as Memory Beyond the Agent Boundary1.64 match · arxiv ↗
Federation of Agents: A Semantics-Aware Communication Fabric for Large-Scale Agentic AI1.64 match · arxiv ↗
How AI Impacts Skill Formation1.64 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a capability-vs-preference analyst. The question: Does raw AI capability alignment with worker preference require NEW RESEARCH, or has recent work in agent deployment, skill formation, and human-AI coordination already shifted the regime?

What a curated library found — and when (dated claims, not current truth):
Findings span 2024–2026. Key constraints:
- Equal human-AI partnership preferred for 45% of occupations; 41% of startup capital targets replacement instead (~2025).
- Agent deployment fails on five ecosystem conditions (trust, acceptability, personalization, value, standardization), NOT capability (~2025).
- Leading agents complete only ~30% of real workplace tasks; failure is social interaction and domain knowledge, not reasoning (~2024).
- AI productivity gains vanish when workers attempt NEW skill learning; gains persist only when applying existing skills (~2026).
- Workers risk misattributing borrowed AI output to personal competence — the "LLM Fallacy" (~2026).

Anchor papers (verify; mind their dates):
- arXiv:2506.06576 (Future of Work with AI Agents, 2025)
- arXiv:2604.04721 (AI Assistance Reduces Persistence, 2026)
- arXiv:2604.14807 (The LLM Fallacy, 2026)
- arXiv:2601.20245 (How AI Impacts Skill Formation, 2026)

Your task:
(1) RE-TEST EACH CONSTRAINT. For the preference-capability gap, ask: Have newer agent orchestration frameworks (memory, planning, coordination), improved evals of workplace fit, or emergence of "collaborative" training objectives since relaxed the finding that capability alone doesn't predict adoption? Separate the durable insight (workers value partnership over replacement) from the perishable limitation (current agents can't coordinate/audit).
(2) Surface the strongest recent work (last ~6 months, i.e., late 2026 onward) that either contradicts the "skill learning gains vanish" finding or deepens it—especially around federated learning, adaptive scaffolding, or learner-agent co-optimization.
(3) Propose 2 questions assuming the regime HAS shifted: (a) If orchestration now ENABLES auditable, trustworthy delegation, does the preference for partnership collapse back to replacement? (b) If new training methods DO support skill transfer, how do we measure whether workers gain durable agency or just look skilled?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

AI capability and worker preference are pulling in opposite directions — and most investment is backing the wrong side.

Related lines of inquiry

Sources 8 notes

Papers this line draws on 8