SYNTHESIS NOTE

Can AI learn social norms better than humans?

Explores whether large language models can predict cultural appropriateness more accurately than individual humans, and what this reveals about how social knowledge is transmitted and learned.

Synthesis note · 2026-02-22 · sourced from Theory of Mind

Hook: GPT-4.5 is better at knowing what's socially appropriate than any individual human. Not some humans — all of them. 100th percentile. But it makes mistakes that every other AI model also makes in the same way.

The finding:

555 everyday scenarios. "How appropriate is it to laugh at a job interview?" "To cry on a bus?" "To read in church?" When asked to predict the average human judgment, GPT-4.5 was more accurate than every single human participant. Replicated with Gemini 2.5 Pro (98.7%), GPT-5 (97.8%), Claude Sonnet 4 (96.0%).

The AI doesn't just know the rules. It knows the collective sense of a culture better than the people living in it.

Why this matters:

The dominant theory in cognitive science says social norms require embodied experience — you learn what's appropriate by living in a culture, reading faces, feeling social consequences. Statistical learning over text shouldn't be enough. But it is. "Sophisticated models of social cognition can emerge from statistical learning over linguistic data alone."

Language turns out to be a "remarkably rich repository for cultural knowledge transmission." Everything humans write — from etiquette guides to Reddit arguments to novels — encodes social norms. The AI has read more of this than any human could experience in a lifetime.

The catch:

All models show "systematic, correlated errors." Not random mistakes — structured blind spots that every AI architecture shares. The same scenarios that trip up GPT-4.5 also trip up Gemini and Claude. This pattern "indicates potential boundaries of pattern-based social understanding."

There are aspects of social norms that don't make it into text. The unwritten rules that communities enforce through glances, silences, and physical presence. The norms that are so obvious nobody bothers to articulate them. These are the correlated blind spots — and they're exactly the norms you most need to get right in practice.

The tension:

The AI is a savant — extraordinary competence in one dimension (predicting collective norms from text) combined with systematic gaps in another (the norms that never get written down). Better than any individual at the average, blind to the specifics that any local participant would catch immediately.

Flat, not targeted — the post-generation consequence. The savant-from-outside pattern has a specific consequence at the level of generated posts: AI output is flat rather than targeted because no social position is occupied. Normal influencer, commentator, and pundit speech online carries implicit position-taking that situates the speaker relative to the audience — speaking as one of us, or for this community, or against that one. The position-taking is what makes the content addressed to someone in particular, rather than written about a topic in general. AI can predict the average appropriate response but cannot occupy a specific social position vis-à-vis a specific community, because it has no community membership to mark. The output is therefore flat — competent on general norm, absent on the position-taking that would make the post legible as speech from someone to someone. Knowing norms from outside and speaking from outside produce the same residue: content that is addressed to no one in particular and therefore cannot perform the community-specific legitimacy that targeted commentary depends on.

Post structure: Hook (the number) → What it means (embodiment challenge) → The catch (correlated errors) → The tension (savant pattern) → What this means for AI deployment in social contexts

Platform: LinkedIn (300-400 words, practical tone) or Medium (longer with theoretical framing)

Inquiring lines that read this note 72

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

Why do reward structures fail to shape long-term agent learning?

What cognitive capabilities do agents need to internalize social feedback?

How do aggregate reward models systematically exclude minority user preferences?

Does learning community preferences as training rewards operationalize prediction without participation?

Does AI fluency substitute for verifiable accuracy in human judgment?

Why can't AI models internalize audiences the way human experts do?

Can AI-generated outputs constitute genuine knowledge or valid claims?

How does AI-generated content transformation affect public discourse quality?

How should personalization be implemented to improve AI assistant effectiveness?

Can AI safely personalize within negotiated societal bounds?

Can AI systems develop genuine social understanding without embodiment?

How can language models sustain linguistic synchrony and intersubjectivity during dialogue?

How do language models establish social grounding in human dialogue?

Why should disagreement be treated as signal in collaborative reasoning?

How does communicative standing depend on participation in normative communities?

Is embodied interaction necessary for language meaning and genuine agency?

Do accurate-looking LLM outputs hide structural failures in learning and reasoning?

Can output-layer corrections fix fundamental cultural representation deficits in LLMs?

Why do persona-level simulations fail to predict individual preferences accurately?

How can AI systems learn from failures without cascading errors?

When should tasks involve human-AI partnership versus full automation?

How do multi-agent systems achieve genuine cooperation and reasoning?

Does genuine cooperation require rule-based rather than learned behavior?

How does AI adoption affect human skill development and labor equality?

How should AI systems model human resource constraints and expertise levels?

How should human oversight be integrated with autonomous AI systems?

Can automated systems encode human values as reliably as human workers enforce them?

Do language models learn genuine linguistic structure or just surface patterns?

How do language models inherit human biases from training data?

Why do language models reinforce false assumptions instead of correcting them?

How should conversational agents balance goal-driven initiative with user control?

How can AI alignment serve diverse human preferences at scale?

How do we evaluate AI systems when user perception misleads actual performance?

Why do automated selection methods outperform human judgments of relevant context?

Can debate mechanisms prevent silent agreement on wrong answers in multi-agent reasoning?

How do AI models balance competing social goals simultaneously?

How can identical external performance mask different internal representations?

Why do standard social regularization methods miss the actual value networks provide?

How do formal dialogue structures reveal conversation coherence mechanisms?

What social information is missing from language data?

Is model self-awareness based on genuine introspection or pattern matching?

How do neural self-other representations affect AI deception and alignment?

What structural factors drive popularity bias in recommendation systems?

How does AI recommendation convergence mirror the hivemind effect in generation?

Does RLHF training sacrifice accuracy and grounding for user agreement?

Does alignment compound cultural bias that started during pretraining?

How do professional roles and expertise transform with AI-generated content?

Should AI assistants align with role-specific norms rather than user preferences?

Related concepts in this collection 4

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

14 direct connections · 110 in 2-hop network ·medium cluster Open in graph ↗

Can AI learn social norms better than humans? Can AI systems learn social norms without embodied… What makes linguistic agency impossible for langua… Can LLMs acquire social grounding through linguist… Can AI agents learn people better from interviews …

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can AI systems learn social norms without embodied experience? Large language models exceed individual human accuracy at predicting collective social appropriateness judgments. Does this reveal that embodied experience is unnecessary for cultural competence, or do systematic AI failures point to limits of statistical learning?
primary evidence
What makes linguistic agency impossible for language models? From an enactive perspective, does linguistic agency require embodied participation and real stakes that LLMs fundamentally lack? This matters because it challenges whether LLMs can truly engage in language or only generate text.
the theory being challenged
Can LLMs acquire social grounding through linguistic integration? Explores whether LLMs gradually develop social grounding as they become embedded in human language practices, analogous to child language acquisition. Tests whether grounding is a fixed property or an outcome of participatory use.
complicates the trajectory: maybe grounding is already sufficient for norm prediction
Can AI agents learn people better from interviews than surveys? Can rich interview transcripts seed more accurate generative agents than demographic data or survey responses? This matters because it challenges how we build digital simulations of real people.
complementary evidence for the post angle: social norm prediction at 100th percentile + interview-based response replication at 85% demonstrate text-based learning approximates embodied social knowledge across different task dimensions

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

the social norm savant — ai knows your culture better than you do but from the outside

Can AI learn social norms better than humans?

Inquiring lines that read this note 72

Related concepts in this collection 4

Related papers in this collection 8

Search by related questions 5