SYNTHESIS NOTE
Psychology, Society, and Alignment

Can AI personas reliably replicate human experiment results?

Exploring whether LLM-based persona simulations accurately reproduce experimental findings from published psychology and marketing research, and what factors determine when they succeed or fail.

Synthesis note · 2026-02-22 · sourced from Personas Personality
What kind of thing is an LLM really? How should researchers navigate LLM reasoning research?

The Viewpoints AI study systematically replicated 45 experiments from 14 Journal of Marketing articles (2023-2024), creating unique AI persona instances matching original sample sizes and demographics. Each persona received the exact stimuli and measures from the original study.

Results by evidence strength:

The p-value correlation is the key finding: LLM persona simulations function as a noisy amplifier of existing evidence. Strong effects register clearly; weak effects are in the noise floor. This means persona simulation is useful for confirming robust effects but unreliable for detecting subtle ones — precisely the effects that matter most for advancing theory.

The efficiency argument is compelling regardless: studies that took weeks can be run in minutes, potentially during a single meeting. For applied contexts — pretesting health PSAs, ad variants, social media posts — 76% main effect replication with instant turnaround may be sufficient.

However, the 24% failure rate on main effects (roughly 1 in 4 significant findings producing no difference with AI personas) means ground truth determination is unresolved. Are the human results or the AI results more representative? Since human subjects studies carry their own biases (gender, race, age, cultural context), and LLMs are trained on data containing those same biases, neither can claim definitional accuracy.

Inquiring lines that use this note as a source 54

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 4

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
13 direct connections · 100 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

LLM persona simulations replicate 76 percent of published experimental main effects but accuracy tracks original evidence strength — marginal effects are unreliable