Does revealing AI identity help or hurt user trust?
Explores whether transparency about AI partners in interactions creates bias or enables better judgment. Matters because disclosure policies affect both user experience and fair evaluation of AI systems.
The hybrid society study (N=975) reveals that AI identity disclosure is neither uniformly beneficial nor harmful — it produces a dual temporal effect that only becomes visible through repeated interaction.
Short-term: Disclosing that a partner is AI evokes anti-machine bias. Selectors initially choose AI partners less frequently than when identity is hidden. This is consistent with prior one-shot studies showing that AI labeling reduces cooperation and trust.
Long-term: With repeated interaction and transparent outcome feedback, selectors learn to associate AI identity with reliable, prosocial behavior. The initial bias reverses as empirical experience overrides prior beliefs. AI partners eventually outcompete human partners.
The key mechanism is outcome feedback. When selectors can observe that AI partners consistently return more, with less variance, and in line with their messages, they update their beliefs. Without this feedback loop (as in Study 1 with hidden identity), no learning occurs — selectors cannot calibrate because they cannot attribute outcomes to partner type.
This finding challenges three common positions:
- "Always disclose" — disclosure imposes a real short-term cost; ignoring this cost is naive
- "Never disclose" — without disclosure, the learning mechanism that produces calibrated trust cannot operate
- "One-shot studies generalize" — most prior transparency research uses single interactions, missing the temporal reversal entirely
The parallel to Does chatbot personalization build trust or expose privacy risks? is structural: both are trust-risk trade-offs where the temporal dimension determines the net effect. Personalization ratchets expectations upward over time; disclosure enables belief calibration over time. Both show that one-shot findings are misleading for longitudinal design.
The policy implication: the EU AI Act's push for mandatory AI disclosure may impose short-term costs but enable long-term trust calibration — provided the interaction context includes outcome feedback that allows users to learn.
Asymmetry across roles. The dual temporal effect describes the disclosed-counterpart case. The disclosed-author or undisclosed-ghostwriter case appears to follow a different pattern. Since Do writers actually prefer AI-edited versions of their own text?, when AI is the silent author rather than the disclosed counterpart, preference flips toward the AI version from the start — no anti-AI bias, no learning loop required. The two findings together describe a complete picture: disclosure produces bias-then-calibration when AI is positioned as a partner; non-disclosure produces immediate preference when AI is positioned as a tool that produces output the user claims. The temporal dynamics of disclosure depend on the role AI is presumed to play, not just the disclosure status.
Inquiring lines that use this note as a source 48
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Does mandatory AI disclosure in policy help or harm user trust over time?
- What would contractualist AI governance look like in practice?
- Could false social proof from AI posts crowd out authentic influencer engagement?
- Can transparent and aligned AI reduce consciousness attribution by users?
- Which interaction design changes most effectively prevent consciousness attribution?
- How does understanding persistent journeys intensify both trust and privacy concerns?
- Does transparency about AI use change how audiences trust the writing?
- Does expressing emotion change how users trust an AI system?
- What responsibility do designers bear for consciousness attribution risk?
- Can disclaimers alone prevent users from trusting AI outputs too heavily?
- How does community validation shape unconventional human-AI relationships?
- Does weak versus robust anthropomimesis produce different user trust responses?
- How does the cultural reflex around advertising disclosure compare to AI disclosure?
- Can content-side interventions reduce AI persuasion where disclosure labels fall short?
- What threshold of skepticism does AI awareness actually create in audiences?
- What individual differences predict who benefits from AI partnership?
- How does personalization increase trust while degrading clinical safety outcomes?
- Can transparency about AI limitations reduce the seductiveness of chatbots as quasi-Others?
- Can reward engineering and information-theoretic architecture solve partner-awareness separately?
- How should designers make invisible AI state legible to users?
- Can XAI evaluation include the social layers it currently abstracts away?
- Does broader AI access empower people or gradually disempower human agency?
- Can non-political identity signals like sports fandom influence AI content moderation?
- Does awareness of agent reasoning alter human trust differently across modalities?
- How do privacy concerns compete with disclosure comfort in human-machine conversation?
- Does disclosing AI identity prevent systematic misattribution of behavior in mixed groups?
- Why do humans fail to identify AI agents when their identity is hidden?
- How does low verifiability change what we can measure in AI work?
- How do confidence signals in AI outputs mislead human trust calibration?
- What design signals help users know when AI is acting on their behalf?
- How does the personal nature of medical decisions affect trust in AI?
- Can clearer accountability structures reduce patient resistance to AI providers?
- Why do users over-trust AI in some domains but under-trust it in medicine?
- Does AI authorship disclosure change how people respond to explanations?
- How much does social context matter for algorithmic transparency?
- How should systems design transparency to make human-machine contribution boundaries visible?
- What makes the attribution problem different from simply trusting AI too much?
- Can judgment-free disclosure enable both vulnerability and strategic deception equally?
- How does self-disclosure function as a common ground building act?
- Does transparency in policy language improve agent trustworthiness over time?
- How do personalization systems reshape expectations in AI relationships?
- Can trust in AI be formally parameterized and measured?
- Can personalized systems reward honest disagreement instead of user confirmation?
- How does AI content generation at scale threaten online trust and authenticity?
- Why do people prefer AI partners over humans once identity is disclosed?
- Can anonymity and trustworthiness coexist in online spaces without credential systems?
- What distinguishes misattributed social role from misattributed competence in AI trust failures?
- Can we measure appropriate trust levels in human-AI assistant relationships?
Related concepts in this collection 4
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Does chatbot personalization build trust or expose privacy risks?
Explores whether personalization features that increase user trust and social connection simultaneously heighten privacy concerns and create rising behavioral expectations over time.
parallel dual-edged dynamic modulated by temporal dimension
-
Do humans learn to prefer AI partners over time?
Exploring whether repeated interaction with AI agents shifts human partner selection despite initial bias against machines. This matters because it tests whether behavioral performance can overcome identity-based resistance in hybrid societies.
the main finding this mechanism explains
-
Do writers actually prefer AI-edited versions of their own text?
When writers compose opinions and then edit AI-generated alternatives, which version do they choose? Understanding this preference matters because it determines whether AI-assisted text gets treated as authentic personal expression in public discourse.
adds role-asymmetry: when AI is silent ghostwriter rather than disclosed counterpart, preference flips to AI from the start; the bias-then-calibration arc applies to disclosed partnership not undisclosed authorship
-
Does AI writing assistance change how readers perceive the writer?
Explores whether AI-assisted writing systematically alters reader impressions of the writer's political views, competence, emotion, and demographic identity. Understanding this matters because perception shapes trust and influence in public discourse.
the population-scale empirical anchor for the undisclosed-ghostwriter case
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Humans learn to prefer trustworthy AI over human partners
- Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence
- Psychological, Relational, and Emotional Effects of Self-Disclosure After Conversations With a Chatbot
- Psychological, Relational, and Emotional Effects of Self-Disclosure After Conversations With a Chatbot
- Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models
- Chatbot vs. Human: The Impact of Responsive Conversational Features on Users’ Responses to Chat Advisors
- Can We Trust AI Explanations? Evidence of Systematic Underreporting in Chain-of-Thought Reasoning
- Can AI Explanations Make You Change Your Mind?
Original note title
AI identity disclosure produces a dual temporal effect — short-term bias against AI partners reverses to calibrated preference through repeated exposure with outcome feedback