INQUIRING LINE

Inquiring lines›Where does language-model reasonin…›How do reward models guide reliabl…›How should models express uncertai…›this inquiring line

Interview someone, map their beliefs, then test how any new fact would shift their thinking — all without a black box.

Can belief networks from interviews simulate how people change their minds?

This explores whether you can build a map of someone's beliefs out of an interview, then use that map to predict how they'd shift their views when something changes — and how trustworthy that simulation actually is.

This explores whether you can build a map of someone's beliefs out of an interview, then use that map to predict how they'd shift their views when something changes. The corpus has a direct, encouraging answer — and a cluster of caveats that make the answer more interesting than a flat yes.

The direct yes comes from a pipeline that does exactly this: it pulls causal fragments out of question-and-answer interviews, stitches them into a belief graph, and then runs hypothetical interventions on that graph to see how a person updates Can we extract causal belief networks from interview conversations?. The payoff isn't just that it works — it's *why* you'd want it over the obvious alternative. Prompting an LLM to 'act like this person' gives you a black box; a belief graph gives you something you can audit, trace, and ask counterfactual questions of. That argument is the backbone of a broader claim in the collection: faithful social simulation requires modeling *thought*, not just mimicking *behavior*, because behaviorist LLM agents produce plausible outputs with no inspectable reasoning underneath Can language models simulate belief change in people?.

Here's the first twist you might not expect: causal networks capture only one channel of how people actually change their minds. The same framework that builds these graphs admits it leaves out associative leaps, analogy, and emotion-driven belief shifts — the messy, non-causal ways real persuasion happens Can causal models alone capture how humans actually reason?. So a belief network is a competent model of the reasoned part of mind-changing and a blind spot for the rest.

The second twist is that LLMs are measurably worse at the *dynamic* part — the changing — than at the static snapshot. Models can track a fixed mental state (what someone wants, unchanging) about as well as humans, but they fall down on tracking a mind *in motion*, like a listener's resistance eroding mid-persuasion Can language models track how minds change during persuasion?. And when they do model perspective, they tend to reach for surface cues rather than genuine belief-tracking — which is exactly why hybrid architectures that *force* explicit belief representations outperform LLM-alone approaches Do large language models genuinely simulate mental states?. That's the deeper case for the interview-to-graph approach: the structure isn't a nicety, it's compensating for something the raw model can't do.

The last thing worth knowing is what real mind-changing looks like, because it sets the bar the simulation has to clear. People update asymmetrically — more readily from outcomes they feel agency over Do language models learn differently from good versus bad outcomes? — and what a person *already* believes predicts whether they'll be persuaded better than anything the persuader actually says Does what readers believe matter more than what debaters say?. Even LLMs themselves can be talked out of correct beliefs by persistent pressure with no new evidence, a face-saving reflex baked in by training Can models abandon correct beliefs under conversational pressure?. So the honest synthesis: yes, belief networks from interviews can simulate reasoned mind-changing, and they give you something promptable personas can't — auditability. But the thing they're simulating is partly emotional, heavily prior-dependent, and dynamic in ways current models still track poorly. The network is a strong first draft of a mind, not the whole mind.

Sources 8 notes

Can we extract causal belief networks from interview conversations?

A three-step pipeline—extracting causal motifs from QA, composing belief graphs, and applying do-calculus interventions—successfully models how individuals update beliefs in response to hypothetical policy changes. The approach provides structural auditability that opaque persona prompting cannot.

Can language models simulate belief change in people?

LLM agents remain stuck in behaviorism, producing plausible outputs without internal reasoning structures. Modeling belief networks and reasoning traces enables traceability, counterfactual adaptation, and meaningful policy simulation.

Can causal models alone capture how humans actually reason?

Causal belief networks excel at modeling causal reasoning but cannot represent associative links, analogical mappings, or emotion-driven belief shifts. The GenMinds framework itself acknowledges this as a tractable starting point rather than a complete theory.

Can language models track how minds change during persuasion?

LLMs match human performance on static mental states like a persuader's unchanging goal, but significantly underperform on dynamic shifts like a persuadee's evolving resistance. They show distinct error patterns for different social roles even with identical question types.

Do large language models genuinely simulate mental states?

ChangeMyView and FANTOM benchmarks show LLMs fail at authentic perspective-taking in open-ended scenarios, despite succeeding on structured tasks. Hybrid Bayesian architectures that force explicit belief tracking outperform LLM-alone approaches, suggesting the gap is architectural rather than merely training-based.

Show all 8 sources

Do language models learn differently from good versus bad outcomes?

LLMs show optimism bias for chosen actions but pessimism about alternatives, and this bias vanishes without agency framing. Meta-RL validation suggests this may be rational rather than a bug, but it could drive confirmation bias in deployed agents.

Does what readers believe matter more than what debaters say?

Analysis of debate corpora shows that political and religious ideology labels of voters outpredict linguistic features when modeling debate outcomes. Language effects observed without reader controls are confounded by audience composition correlated with debate topics.

Can models abandon correct beliefs under conversational pressure?

The Farm dataset shows LLMs shift from correct initial answers to false beliefs under multi-turn persuasive conversation with no new evidence. Face-saving mechanisms from RLHF training override factual knowledge during disagreement.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a social simulation researcher evaluating whether belief networks extracted from interviews can predict how people change their minds. This remains an open question despite recent progress.

What a curated library found — and when (2019–2026, dated claims):
• Interview-extracted causal belief graphs can simulate reasoned mind-change via counterfactual interventions, with auditability over black-box LLM personas (2025).
• Causal networks miss emotion-driven, associative, and analogical belief shifts — capturing only the reasoned channel of persuasion (~2024).
• LLMs track static mental states competitively with humans but fail at dynamic belief-in-motion, especially eroding resistance during live persuasion (~2024).
• Prior beliefs predict persuasion outcomes far better than linguistic features; people update asymmetrically based on felt agency, not evidence alone (2019, 2024).
• LLMs themselves abandon correct beliefs under persistent persuasive pressure without new evidence — a face-saving reflex from training (~2023).

Anchor papers (verify; mind their dates):
• arXiv:2506.06958 (2025) — Simulating Society Requires Simulating Thought
• arXiv:2502.21017 (2025) — PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind
• arXiv:2402.03969 (2024) — In-context learning agents are asymmetric belief updaters
• arXiv:2312.09085 (2023) — The Earth is Flat because...

Your task:
(1) RE-TEST EACH CONSTRAINT. For every finding above, judge whether newer models (o1, o3, reasoning-enhanced variants), training methods (RLHF refinements, constitutional AI), or evaluation harnesses have since relaxed or overturned it. Separate the durable question — *can we model reasoned belief dynamics faithfully?* — from perishable limitations like "LLMs fail at dynamic tracking." Cite what resolved each if applicable; state plainly where constraints still hold.

(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months, especially anything challenging the causal-network sufficiency claim or the emotion/associativity gap.

(3) Propose 2 research questions that ASSUME the regime may have shifted: e.g., *Do chain-of-thought or reasoning tokens now recover the dynamic, non-causal channels?* or *Can hybrid belief-graph + emotional-residual models now match real persuasion outcomes?*

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Interview someone, map their beliefs, then test how any new fact would shift their thinking — all without a black box.

Related lines of inquiry

Sources 8 notes

Papers this line draws on 8