Can AI anticipate whether expert claims will be socially valid?
Expert knowledge involves more than correctness—it requires predicting whether fellow experts will accept a claim as valid. Can AI systems make this social judgment, or are they limited to statistical accuracy?
Expert claims are not just statements of fact. They are validity claims — assertions that carry an implicit "and here is why you should accept this." The implicit dimension is critical: the expert, in making a claim, is simultaneously performing a social calculation about whether this claim will be received as valid by the audience that matters.
This is not the same as being correct. A factually accurate claim can be socially invalid — wrong audience, wrong framing, wrong level of abstraction, wrong moment. And a simplified or imprecise claim can be socially valid — it captures what the audience needs to hear, in a form they can receive. The expert navigates this gap constantly, and the navigation is part of what makes them expert.
The circularity is structural, not incidental. Claims are valid because they are acceptable to the community of experts, and acceptable because they are valid by the community's standards. This is not a logical defect — it is how knowledge works in practice. Expert communities develop shared standards of what counts as a good argument, what evidence is sufficient, what framings are productive. New claims are evaluated against these standards, and the standards evolve through the accumulation of claims. The expert who makes a validity claim is invoking this entire apparatus — and the audience who evaluates it is operating within the same apparatus.
AI cannot perform this operation. When an LLM generates a response to a domain-specific question, it can estimate the probability that its output matches the distribution of "correct" answers in its training data. But this is a different calculation than anticipating whether a claim will be valid in the social sense. Since Should AI alignment target preferences or social role norms?, the normative-standards approach to alignment acknowledges this gap: the system should behave according to role-appropriate norms, not just preference-maximized outputs. But even role-alignment does not replicate the expert's anticipation of audience response, because role-alignment is a general policy, not a contextual judgment about a specific audience in a specific moment.
The practical stakes are highest in soft, interpretive domains. In formal domains (mathematics, logic, parts of engineering), the validity criteria are relatively explicit and standardized. An AI can check a proof against known rules. But in domains where expertise is more hermeneutic — law, medicine, strategic consulting, policy — the validity criteria are deeply contextual. What counts as a compelling argument in one jurisdiction, one clinical context, or one political climate may not count in another. The expert knows this because they are embedded in the context. The AI does not know this because it is embedded in a training distribution.
This connects to the problem of presupposition. Since Can LLMs identify the hidden assumptions that make arguments work?, LLMs can reproduce the surface structure of an argument without having access to the implicit warrants that make the argument valid for a specific audience. The validity claim is the warrant — the implicit "and this is why you should accept this" — and the warrant is audience-specific, context-dependent, and almost never stated in the text that the LLM was trained on.
The consequence for AI-generated expertise is that it can produce claims that look valid — that have the structural markers of expert claims — without being valid in the social sense. The output may be factually accurate, well-structured, and confidently stated, but it may fail the validity test when presented to the expert community because it doesn't account for what that community currently considers important, contested, or settled.
Inquiring lines that use this note as a source 27
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can social validation of expertise exclude systems that lack participatory track records?
- What role shifts occur when experts become custodians of AI knowledge?
- How does AI presentation authority substitute for actual expert judgment?
- What does it mean that AI knowledge is structurally hearsay?
- Can markets price knowledge claims if there is no shared agreement on what backing means?
- What happens to expert credibility when AI-generated claims drown out specialist signals?
- Does surface authority without earned authority create risks in expert judgment?
- Does stripping social context from knowledge claims hollow out their meaning?
- Can AI systems produce genuinely new validity claims without community participation?
- What role did human experts play in raising social alarms historically?
- What happens to professional expertise when judgment gets encoded into systems?
- What makes expert judgment depend on anticipating audience acceptability?
- Can diverse expert demonstrations exceed the knowledge of any single expert?
- How do experts decide which information matters for a specific audience?
- What makes a claim socially valid even if factually imprecise?
- How do experts select which other experts to trust?
- Why do two experts with identical knowledge produce different outcomes in the same situation?
- What expertise survives in a world where AI can generate knowledge on demand?
- What happens when experts prompt using their own technical register?
- Why do medical diagnoses require human judgment even with AI assistance?
- Can artificial systems develop the authority to challenge expert claims?
- How does epistemic stagflation change what expertise actually means?
- Can expert validation scale fast enough to back AI token production?
- What role could knowledge custodians play in validating AI output?
- How do expert communities develop and enforce standards for valid arguments?
- What implicit warrants do expert arguments rely on that AI cannot reliably access?
- Why can't AI truly understand expertise without joining the validating community?
Related concepts in this collection 5
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Should AI alignment target preferences or social role norms?
Current AI alignment approaches optimize for individual or aggregate human preferences. But do preferences actually capture what matters morally, or should alignment instead target the normative standards appropriate to an AI system's specific social role?
role-alignment addresses validity gap but does not replicate contextual judgment
-
Can LLMs identify the hidden assumptions that make arguments work?
LLMs recognize what arguments claim and what evidence they offer, but struggle to identify implicit warrants—the unstated principles that connect evidence to conclusion. This matters because valid reasoning requires understanding these hidden logical bridges.
validity claims require implicit warrants that LLMs cannot access
-
Can models learn argument quality from labeled examples alone?
Explores whether fine-tuning on quality-labeled examples teaches models the underlying criteria for evaluating arguments, or merely surface patterns. Matters because high-stakes assessment tasks depend on reliable, transferable quality judgment.
quality criteria for validity claims cannot be learned from pattern alone
-
Can formal argumentation make AI decisions truly contestable?
Explores whether structuring AI decisions as formal argument graphs (with explicit attacks and defenses) enables users to meaningfully challenge and navigate reasoning in ways unstructured LLM outputs cannot.
formal frameworks attempt to make validity criteria explicit, but context resists full formalization
-
Does any single persuasion technique work for everyone?
Can fixed persuasion strategies like appeals to authority or social proof be reliably applied across different people and situations, or do they require adaptation to individual traits and context?
validity is always contextual, not universal
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- AI Models Exceed Individual Human Accuracy in Predicting Everyday Social Norms
- Beyond Hallucinations: The Illusion of Understanding in Large Language Models
- A sociotechnical perspective for the future of AI: narratives, inequalities, and human control
- Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
- We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy
- GenAI as a Power Persuader: How Professionals Get Persuasion Bombed When They Attempt to Validate LLMs
- Thousands of AI Authors on the Future of AI
- AI for Auto-Research: Roadmap & User Guide
Original note title
validity claims always anticipate audience response — expertise is knowing what will be acceptable to fellow experts not just what is correct