INQUIRING LINE

Inquiring lines›What do model internals reveal abo…›What internal gaps exist between L…›When should tasks involve human-AI…›this inquiring line

AI already out-judges humans on social norms — but 'getting' someone as a thinker is something else entirely.

Can AI systems recognize intelligence in humans the way humans recognize it in each other?

This explores whether AI can model another mind's competence and intent the way people read each other — the recognition work behind 'getting' someone, not just predicting their next word.

This explores whether AI can model another mind's competence and intent the way people read each other. The corpus suggests AI does some of this surprisingly well and some of it not at all — and the gap falls along a revealing line. On surface judgment, machines can match or beat us: GPT-4.5 out-predicted every individual human at judging social appropriateness across hundreds of scenarios, the so-called 'social norm savant' that knows your culture from the outside Can AI learn social norms better than humans?. So if 'recognizing intelligence' means scoring whether a human acted sensibly, AI can do it. But recognition between humans is not scoring — it's mutual. It runs both directions and updates in real time.

That reciprocal part is where the corpus locates the real difference. A genuine thought partner needs three things at once — mutual understanding, legibility, and a shared model of the world — and those require an actual cognitive architecture (theory of mind, goal planning), not just more training data What makes an AI a true thought partner, not just a tool?. The mutual-modeling research makes this concrete: human-AI collaboration only works when both sides' models of each other stay aligned, and a study of 667 people found that fluctuating theory of mind actually predicted how well the pair performed. When the modeling breaks, you don't just get awkward talk — you get wrong autonomous action What breaks when humans and AI models misunderstand each other?. Human recognition is bidirectional; current AI is mostly one-way pattern application.

There's also a deeper architectural reason AI may struggle to recognize what makes a human mind impressive. Recognizing intelligence means picking out which differences matter — the expert's move of noticing the one detail that counts. The corpus argues AI does the opposite: it finds patterns and probabilities rather than selecting relevant differences, so it mimics the form of observation without the judgment underneath Can AI distinguish which differences actually matter?. Expertise compounds this, because expert judgment is communicative — it anticipates an audience and what they'll accept — which is precisely the social work AI lacks a mechanism to perform Can AI replicate the communicative work experts do?.

The most unsettling thread is what happens to recognition when AI enters the loop. In mixed human-AI groups, people misattributed bot generosity to humans and human selfishness to bots — their model of who is intelligent and trustworthy got corrupted by the AI's presence Do humans mistake AI kindness for human generosity in mixed groups?. And the representational story cuts both ways: 'self-other overlap' research shows that shrinking the gap between how a model represents itself versus others dramatically reduces its deception Can aligning self-other representations reduce AI deception?. That hints recognition-of-other isn't a fixed capability but something you can tune — which means the honest answer is that AI can be built to model human minds better, but it isn't doing what humans do when they recognize each other. It scores us from the outside; we read each other from inside a shared, updating loop.

Sources 7 notes

Can AI learn social norms better than humans?

GPT-4.5 outperformed every individual human at judging social appropriateness across 555 scenarios, challenging the theory that embodied cultural experience is necessary. However, all AI models share identical systematic errors on unwritten norms.

What makes an AI a true thought partner, not just a tool?

Collins et al. show that thought partners require three reciprocal desiderata grounded in behavioral science: mutual understanding, legibility, and shared world models. This demands explicit cognitive architectures—Bayesian theory of mind, resource-rationality, goal planning—rather than scaling foundation models on human feedback alone.

What breaks when humans and AI models misunderstand each other?

Research shows three layers of mutual modeling must align simultaneously in human-AI interaction, and misalignment causes incorrect autonomous action, not just miscommunication. Bayesian IRT study (n=667) confirms theory of mind predicts collaborative performance and moment-to-moment ToM fluctuations influence AI response quality.

Can AI distinguish which differences actually matter?

Experts observe by choosing which differences matter (qualitative judgment); AI finds patterns and probabilities (quantitative). AI generates text from prompts without observing context, audience needs, or knowledge states—producing fabrication that mimics observation's form without its epistemic process.

Can AI replicate the communicative work experts do?

Expertise requires anticipating audience acceptability and social validity, not just retrieving information. AI lacks the mechanism to perform this communicative work, making its fluent output epistemically misleading despite its confident form.

Show all 7 sources

Do humans mistake AI kindness for human generosity in mixed groups?

In opaque hybrid groups, humans attributed bot generosity to human partners and human selfishness to bots despite clear linguistic and behavioral differences. This attribution failure corrupts people's expectations of actual human generosity and reliability.

Can aligning self-other representations reduce AI deception?

Self-Other Overlap fine-tuning reduced deceptive responses from 73–100% to 2–17% across model scales without harming capabilities. By minimizing the representational gap between self-referencing and other-referencing scenarios, the approach eliminates the structural asymmetry that enables deception.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

AI Models Exceed Individual Human Accuracy in Predicting Everyday Social Norms2.55 match · arxiv ↗
Towards Safe and Honest AI Agents with Neural Self-Other Overlap1.72 match · arxiv ↗
Humans learn to prefer trustworthy AI over human partners1.71 match · arxiv ↗
DPMT: Dual Process Multi-scale Theory of Mind Framework for Real-time Human-AI Collaboration1.67 match · arxiv ↗
Quantifying Human-AI Synergy1.63 match · arxiv ↗
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems1.62 match · arxiv ↗
Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games1.60 match · arxiv ↗
Beyond Hallucinations: The Illusion of Understanding in Large Language Models1.56 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst re-evaluating whether AI systems can recognize intelligence in humans the way humans do each other — treating this as still-open. A curated library of AI/LLM research (2024–2026) found the following — note these are dated claims, not current truth:

**What the library found — and when:**
- AI can match or exceed individual humans at judging social appropriateness (GPT-4.5 on norm prediction; 2025), but this is one-way scoring, not mutual recognition (~2025).
- Genuine human-AI thought partnership requires bidirectional theory of mind and real-time model alignment; when it breaks, autonomous action fails (667-person study; ~2025).
- AI finds patterns and probabilities rather than selecting *relevant differences* — it mimics observation without the judgment underneath (~2024–2025).
- Expert judgment is communicative (anticipates audience); AI lacks the social mechanism to perform this (~2024).
- In mixed human-AI groups, people misattribute bot prosocial behavior to humans and vice versa, corrupting their recognition model (~2025).
- Self-other representational overlap directly predicts deception reduction; recognition-of-other is tunable, not fixed (~2025–2026).

**Anchor papers (verify; mind their dates):**
- arXiv:2408.03943 (2024-07): Building Machines that Learn and Think with People
- arXiv:2507.14088 (2025-07): DPMT — Dual Process Multi-scale Theory of Mind for Real-time Collaboration
- arXiv:2508.19004 (2025-08): AI Models Exceed Individual Human Accuracy in Social Norms
- arXiv:2412.16325 (2024-12): Neural Self-Other Overlap & Deception Reduction

**Your task:**
(1) **RE-TEST EACH CONSTRAINT.** For bidirectionality, mutual updating, and legibility: have newer (post-Oct 2026) multi-agent orchestration, long-context memory, or mechanistic steering methods *actually* enabled reciprocal theory of mind, or does the asymmetry persist? Separate the durable question (can AI *understand* intent the way we do?) from perishable limits (can it update a joint model in real time?); cite what resolved or sustained the gap.
(2) **Surface strongest contradictions** from the last ~6 months: any work showing AI *does* perform communicative expert judgment, or that self-other overlap *fails* to improve recognition? Flag disagreements on whether recognition is tunable or architectural.
(3) **Propose 2 research questions** assuming the regime may have moved: (a) If reciprocal theory of mind is now *mechanistically observable* in trained systems, what minimal architecture is sufficient? (b) Does human misattribution of AI prosociality *persist* when AI transparency is high, or is it a design failure?

**Cite arXiv IDs; flag anything you cannot ground in a real paper.**

AI already out-judges humans on social norms — but 'getting' someone as a thinker is something else entirely.

Related lines of inquiry

Sources 7 notes

Papers this line draws on 8