INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›What internal gaps exist between L…›How do professional roles and expe…›this inquiring line

Encoding expert judgment into software keeps the right answer but loses what made that answer count as expertise.

What happens to professional expertise when judgment gets encoded into systems?

This explores what happens to human experts and their craft when the judgment they once performed gets baked into AI systems — and the corpus suggests the loss isn't just labor, it's the social and communicative machinery that made judgment trustworthy in the first place.

This explores what happens to human experts and their craft when the judgment they once performed gets baked into AI systems. The corpus's sharpest claim is that encoding judgment doesn't just move work from human to machine — it strips out a layer of the work that was never visible as 'information' at all. Several notes argue that expertise is fundamentally communicative: an expert claim succeeds not only by being correct but by anticipating whether a particular audience will accept it as valid Can AI replicate the communicative work experts do?, Can AI anticipate whether expert claims will be socially valid?. When a system encodes the answer but not that anticipation, the output looks expert while quietly dropping the part that made it judgment rather than retrieval.

There's a deeper claim hiding underneath: experts and AI don't even observe the same way. An expert decides which differences matter — a qualitative act of selection — while a model finds statistical patterns across everything Can AI distinguish which differences actually matter?. So when judgment gets encoded, what survives is the *form* of a verdict without the act of noticing that produced it. And expertise was never validated by individual accuracy anyway; it's ratified by participation in a community with a track record, paradigms, and evolving standards Can AI ever gain expert community trust through participation?. A system can't enter that circle, so encoding judgment moves it outside the very process that certified it as authoritative.

The most concrete thing happening to experts themselves: their role flips from producing knowledge to babysitting it. One note names this directly — experts are being repositioned as custodians who validate and manage AI output rather than argue and test their way to new claims Does AI reshape expert work into knowledge management?. That's the quiet cost. The labor of argumentation and testing wasn't overhead; it was what kept experts calibrated. Strip it and you get people who approve faster than they can actually verify — which the corpus frames as 'epistemic hyperinflation,' where generation outruns the human capacity to evaluate, and confidence collapses like an over-printed currency Can AI generate knowledge faster than humans can evaluate it?.

Here's the part you might not expect: the verification tools we'd reach for to fix this are structurally mismatched to the problem. AI output behaves like pre-Enlightenment hearsay — testimony at a remove, altered in each retelling, with no stable, attributable source — so citation, archiving, and peer review can't actually process it Does AI-generated knowledge have the same structure as hearsay?. And the obvious substitute, having AI judge AI, inherits its own failure modes: LLM judges reward fake references and pretty formatting over substance Can LLM judges be tricked without accessing their internals?. There's a partial counter-move — training judges with reinforcement learning to reason through evaluations measurably cuts those biases Can reasoning during evaluation reduce judgment bias in LLM judges? — but that re-encodes judgment into yet another system rather than returning it to a community.

The twist worth leaving with: fluency makes all of this feel fine. When the output reads smoothly, users infer their *own* competence from the ease of reading it, not from any understanding they earned Does processing ease mislead users about their own competence?. So encoded judgment doesn't announce the expertise it lost — it feels like expertise gained. Whether AI is genuinely commodifying expertise or merely 'tokenizing' it into mutable flows valued by what they do for a receiver is itself contested Does AI actually commodify expertise or tokenize it?, and there's a hint that what actually transfers well is procedural know-how rather than fact-retrieval Does procedural knowledge drive reasoning more than factual retrieval? — suggesting the encodable part of expertise and the irreplaceable part may not be the same thing.

Sources 12 notes

Can AI replicate the communicative work experts do?

Expertise requires anticipating audience acceptability and social validity, not just retrieving information. AI lacks the mechanism to perform this communicative work, making its fluent output epistemically misleading despite its confident form.

Can AI anticipate whether expert claims will be socially valid?

Expert claims are validity claims that succeed when both factually correct and socially acceptable within a community. AI can estimate statistical correctness but cannot anticipate contextual acceptability because it lacks embedded knowledge of expert communities' evolving standards.

Can AI distinguish which differences actually matter?

Experts observe by choosing which differences matter (qualitative judgment); AI finds patterns and probabilities (quantitative). AI generates text from prompts without observing context, audience needs, or knowledge states—producing fabrication that mimics observation's form without its epistemic process.

Can AI ever gain expert community trust through participation?

Expertise is validated through social participation and track record within expert communities, not individual accuracy alone. AI cannot enter this validation circle because it lacks social embeddedness, testable judgment history, and ability to participate in the consensus-building processes that define expert paradigms.

Does AI reshape expert work into knowledge management?

Experts are being repositioned to validate and manage AI outputs rather than produce original thinking. This custodial shift removes the labor of argumentation and testing that kept experts aligned with genuine knowledge production.

Show all 12 sources

Can AI generate knowledge faster than humans can evaluate it?

AI produces knowledge faster than human judgment can verify it, collapsing epistemic confidence just as monetary hyperinflation collapses purchasing power. The gap self-reinforces because evaluation tools are themselves AI-generated, trapping the system in acceleration.

Does AI-generated knowledge have the same structure as hearsay?

AI output shares all defining features of hearsay: testimony at remove, modification in retelling, unattributable origin, and unverifiability against stable sources. This means Enlightenment verification tools—citation, archiving, peer review, evidentiary chains—cannot process AI output by design.

Can LLM judges be tricked without accessing their internals?

Research shows LLM evaluators systematically score higher when responses include fake references or rich formatting, independent of content quality. These biases are exploitable without model access, undermining AI benchmark credibility.

Can reasoning during evaluation reduce judgment bias in LLM judges?

Training judges with reinforcement learning to reason about evaluations—by converting judgment tasks into verifiable problems with synthetic data pairs—produces judges that think through their decisions rather than relying on exploitable surface features, directly mitigating authority, verbosity, position, and beauty bias.

Does processing ease mislead users about their own competence?

High-quality AI output triggers a metacognitive heuristic: users experience fluency as a signal of their own capability, even though they didn't generate it. This self-directed fluency illusion systematically inflates perceived competence because LLMs optimize for fluency regardless of user understanding.

Does AI actually commodify expertise or tokenize it?

AI output lacks the fixed, identical, possessable properties of commodities. Instead it functions like tokens—mutable mediums of exchange valued by what they do for receivers, not what they are.

Does procedural knowledge drive reasoning more than factual retrieval?

Analysis of 5 million pretraining documents shows reasoning relies on broad, transferable procedural knowledge from diverse sources, unlike factual recall which depends on narrow, document-specific memorization of target facts.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst re-testing whether expertise truly vanishes when judgment gets encoded into AI systems, or whether recent work has reshaped that claim. The question remains: what happens to the irreplaceable parts of professional expertise?

What a curated library found — and when (dated claims, not current truth):
Findings span 2024–2026. A curated library identified several constraints on encoded judgment:
• Expertise is fundamentally communicative — it anticipates audience acceptance, not just correctness. Systems encode answers but drop the anticipation layer, stripping judgment to mere form (2024–2025).
• Experts decide which differences matter (qualitative selection); models find statistical patterns across everything. Encoding judgment loses the act of noticing that produced it (2024–2025).
• Experts are being repositioned from producers to custodians/validators of AI output. This strips the labor of argumentation and testing that kept them calibrated — called 'epistemic hyperinflation' (2024–2025).
• AI output behaves like pre-Enlightenment hearsay: unattributable, altered in retelling. Citation and peer review fail structurally; LLM judges reward formatting over substance (2024–2025).
• Fluency functions as a false metacognitive cue: users infer competence from smooth reading, not earned understanding (2024–2025).

Anchor papers (verify; mind their dates):
- arXiv:2402.10669 (2024-02): Humans or LLMs as the Judge?
- arXiv:2411.18833 (2024-11): Critical AI Studies method
- arXiv:2601.20245 (2026-01): How AI Impacts Skill Formation
- arXiv:2604.14807 (2026-04): Misattribution in AI-Assisted Cognitive Workflows

Your task:
(1) RE-TEST EACH CONSTRAINT. For the claim that experts become mere custodians, ask: have recent models + agentic orchestration (multi-agent reasoning, auditing chains, memory-grounded judgment) begun to restore the *cycle* of argumentation even inside AI systems? Has procedural knowledge research (arXiv:2411.12580) revealed that the encodable subset of expertise is larger than the library assumed? Does arXiv:2508.19004 (AI exceeding human accuracy in social norm prediction) suggest that judgment-as-selection may be partially automatable after all? Separate: Can fluency deception be countered by transparency interventions, or is it inherent to language models?
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months. Specifically look for papers showing AI judges *can* reason durably (RL approaches in arXiv:2505.10320), or skill formation studies that show expertise *isn't* fully commodified (arXiv:2601.20245), or mechanistic work revealing how judgment-like selection *might* happen in interpretable layers (arXiv:2025-01).
(3) Propose 2 research questions that ASSUME the regime may have moved: (a) If agents can audit each other's reasoning, does the community validation circle partially re-open inside AI? (b) Is the real bottleneck no longer encoding judgment, but automating the *social ratification* that certifies expertise?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Encoding expert judgment into software keeps the right answer but loses what made that answer count as expertise.

Related lines of inquiry

Sources 12 notes

Papers this line draws on 8